From 5b08627eaaf449341718a56467770e625f510e41 Mon Sep 17 00:00:00 2001
From: Sean Whalen <44679+seanthegeek@users.noreply.github.com>
Date: Wed, 20 May 2026 19:29:09 -0400
Subject: [PATCH] Split tests.py into per-module tests/test_<module>.py (#774)
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

* Split tests.py into per-module tests/test_<module>.py

The 5174-line tests.py monolith is split into per-module files under
tests/, mirroring the checkdmarc layout:

  tests/test_init.py          parsedmarc/__init__.py parsing surface
  tests/test_cli.py           parsedmarc/cli.py + config / env-vars / SIGHUP
  tests/test_utils.py         parsedmarc/utils.py (DNS, IP info, PSL, etc.)
  tests/test_webhook.py       parsedmarc/webhook.py
  tests/test_kafkaclient.py   parsedmarc/kafkaclient.py
  tests/test_splunk.py        parsedmarc/splunk.py
  tests/test_syslog.py        parsedmarc/syslog.py
  tests/test_loganalytics.py  parsedmarc/loganalytics.py
  tests/test_gelf.py          parsedmarc/gelf.py
  tests/test_s3.py            parsedmarc/s3.py
  tests/test_maps.py          parsedmarc/resources/maps/ maintainer scripts

The split is purely a redistribution — no test bodies changed, no tests
added or removed. All 276 existing tests pass under the new layout.

The current tests.py contains two kitchen-sink classes (`Test` at line 54
and `TestEnvVarConfig` at line 2360) holding tests that span many
modules. Their methods are routed to the correct per-module file by name
prefix; the wholly-thematic classes (TestExtractReport, TestUtilsXxx,
TestSighupReload, etc.) move whole. Each target file gets its own
`class Test(unittest.TestCase)` for the redistributed kitchen-sink
methods, plus the thematic classes verbatim.

Wiring updates:
- `.github/workflows/python-tests.yml`: `pytest ... tests.py` →
  `python -m pytest ... tests/` (also switches to `python -m pytest` per
  the checkdmarc convention so cwd lands on the project root).
- `pyproject.toml`: adds `[tool.pytest.ini_options] testpaths = ["tests"]`
  and `[tool.coverage.run] source = ["parsedmarc"]` with an `omit` for
  `parsedmarc/resources/maps/*.py`. The maps scripts are maintainer-only
  batch tooling that ships out of the wheel; excluding them from
  coverage makes the headline number reflect only installed library
  code. Runtime coverage on the new layout is 59% (was 45% with maps
  counted), and PR-B will push it to 90%+.
- `AGENTS.md`: documents the new layout and how to run individual files
  / tests; tells future contributors not to reintroduce a monolithic
  tests.py.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* Restore 66.9% coverage baseline (count tests/ + parsedmarc)

Master's headline 66.9% number on Codecov includes the tests.py file
itself (99.35% covered) being measured alongside parsedmarc/*.  The
original tests.py had no `[tool.coverage.run]` block, so coverage's
default — "measure every file imported during the run" — counted the
test code as if it were product code.

The split commit added `source = ["parsedmarc"]` which suppressed
measurement of the test files (correct in principle, since test files
aren't shipped code), and that alone made the headline number drop by
~8 percentage points without any actual loss of testing.  This commit
swaps `source` for an explicit `include = ["parsedmarc/*", "tests/*"]`
so both halves are measured the way they were on master.  Verified:
276 tests, 66.96% line coverage (effectively unchanged from master's
66.90%).

If you want the shipped-code-only number (was the headline that this
commit overrides), run `pytest --cov=parsedmarc tests/`.  That number
is currently 59% and is the focus of the upcoming coverage-expansion PR.

Also adds junit.xml to .gitignore so the CI artefact doesn't get
accidentally committed.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* Restrict coverage to shipped code (`source = ["parsedmarc"]`)

Reverts the prior commit's `include = ["tests/*"]`. Counting the test
files toward coverage was wrong — it conflates "shipped code exercised
by tests" with "test code that pytest auto-runs", inflates the headline
number, and rewards writing more tests rather than tests that verify
more code. Master's apparent 66.9% was an artefact of the old
monolithic tests.py having no [tool.coverage.run] block at all; coverage's
default behaviour measured every imported file, including the test file
itself at ~99% "covered", which added ~8 percentage points to the
displayed number without any real testing signal.

Restricting to `source = ["parsedmarc"]` plus the existing maps omit
gives a meaningful baseline: 59% of shipped code is exercised by the
test suite today. That's the number the next PR is targeting to lift
to 90%+ before the 10.0.0 release; the Codecov "drop" here is a
measurement correction, not a regression.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
---
 .github/workflows/python-tests.yml |    2 +-
 .gitignore                         |    2 +
 AGENTS.md                          |    9 +-
 pyproject.toml                     |   23 +
 tests.py                           | 5174 ----------------------------
 tests/__init__.py                  |    0
 tests/test_cli.py                  | 1809 ++++++++++
 tests/test_gelf.py                 |   23 +
 tests/test_init.py                 | 2310 +++++++++++++
 tests/test_kafkaclient.py          |   58 +
 tests/test_loganalytics.py         |   53 +
 tests/test_maps.py                 |  142 +
 tests/test_s3.py                   |   23 +
 tests/test_splunk.py               |   49 +
 tests/test_syslog.py               |   39 +
 tests/test_utils.py                |  722 ++++
 tests/test_webhook.py              |   76 +
 17 files changed, 5336 insertions(+), 5178 deletions(-)
 delete mode 100755 tests.py
 create mode 100644 tests/__init__.py
 create mode 100644 tests/test_cli.py
 create mode 100644 tests/test_gelf.py
 create mode 100644 tests/test_init.py
 create mode 100644 tests/test_kafkaclient.py
 create mode 100644 tests/test_loganalytics.py
 create mode 100644 tests/test_maps.py
 create mode 100644 tests/test_s3.py
 create mode 100644 tests/test_splunk.py
 create mode 100644 tests/test_syslog.py
 create mode 100644 tests/test_utils.py
 create mode 100644 tests/test_webhook.py
diff --git a/.github/workflows/python-tests.yml b/.github/workflows/python-tests.yml
index 200deaf..a617098 100644
--- a/.github/workflows/python-tests.yml
+++ b/.github/workflows/python-tests.yml
@@ -73,7 +73,7 @@ jobs:
         pip install .[build]
     - name: Run unit tests
       run: |
-        pytest --cov --cov-report=xml --junitxml=junit.xml -o junit_family=legacy tests.py
+        python -m pytest --cov --cov-report=xml --junitxml=junit.xml -o junit_family=legacy tests/
     - name: Test sample DMARC reports
       run: |
         pip install -e .
diff --git a/.gitignore b/.gitignore
index efaaddb..155f341 100644
--- a/.gitignore
+++ b/.gitignore
@@ -147,3 +147,5 @@ parsedmarc/resources/maps/unknown_domains.txt
 *.bak
 *.lock
 parsedmarc/resources/maps/domain_info.tsv
+coverage.json
+junit.xml
diff --git a/AGENTS.md b/AGENTS.md
index 0e92c1e..da8f7c7 100644
--- a/AGENTS.md
+++ b/AGENTS.md
@@ -13,10 +13,13 @@ parsedmarc is a Python module and CLI utility for parsing DMARC aggregate (RUA),
 pip install .[build]
 
 # Run all tests with coverage
-pytest --cov --cov-report=xml tests.py
+pytest --cov --cov-report=xml tests/
+
+# Run one test module
+pytest tests/test_init.py
 
 # Run a single test
-pytest tests.py::Test::testAggregateSamples
+pytest tests/test_init.py::Test::testAggregateSamples
 
 # Lint and format
 ruff check .
@@ -107,7 +110,7 @@ IP address info cached for 4 hours, seen aggregate report IDs cached for 1 hour
 - Ruff for formatting and linting (configured in `.vscode/settings.json`). Run `ruff check .` and `ruff format --check .` after every code edit, before committing.
 - TypedDict for structured data, type hints throughout.
 - Python ≥3.10 required.
-- Tests are in a single `tests.py` file using unittest; sample reports live in `samples/`.
+- Tests live under `tests/` as `tests/test_<module>.py`, one per top-level `parsedmarc/*` module (e.g. `tests/test_init.py` for `parsedmarc/__init__.py`, `tests/test_cli.py` for `parsedmarc/cli.py`). All test classes use `unittest`. Sample reports live in `samples/`. Run with `pytest tests/`; run one file with `pytest tests/test_init.py`. New tests go in the file whose module they exercise — do not reintroduce a monolithic test file.
 - File path config values must be wrapped with `_expand_path()` in `cli.py`.
 - Maildir UID checks are intentionally relaxed (warn, don't crash) for Docker compatibility.
 - Token file writes must create parent directories before opening for write.
diff --git a/pyproject.toml b/pyproject.toml
index f297060..2688343 100644
--- a/pyproject.toml
+++ b/pyproject.toml
@@ -96,3 +96,26 @@ exclude = [
     # which must keep shipping for `importlib.resources.files()` lookups).
     "parsedmarc/resources/maps/[!_]*.py",
 ]
+
+[tool.pytest.ini_options]
+# Default to the per-module test layout under tests/. New tests should go
+# into tests/test_<module>.py to match the file they exercise; do not
+# reintroduce a monolithic tests.py.
+testpaths = ["tests"]
+
+[tool.coverage.run]
+# Coverage measures shipped code only. Master's reported ≈66.9% on
+# Codecov was an artefact of the old monolithic tests.py having no
+# [tool.coverage.run] block, which let coverage's default behaviour
+# measure every file imported during the run — including the test file
+# itself at ~99% "covered". That inflated the headline by ~8 percentage
+# points without any actual testing signal. Restricting to the parsedmarc
+# package gives a meaningful number that tracks how much of the shipped
+# library the test suite actually exercises.
+source = ["parsedmarc"]
+# Maintainer-only batch scripts under parsedmarc/resources/maps/ ship
+# out of the wheel (see the [tool.hatch.build] exclude block above) —
+# omit them so the headline number reflects only installed library code.
+omit = [
+    "*/parsedmarc/resources/maps/*.py",
+]
diff --git a/tests.py b/tests.py
deleted file mode 100755
index 1c7c570..0000000
--- a/tests.py
+++ /dev/null
@@ -1,5174 +0,0 @@
-#!/usr/bin/env python3
-# -*- coding: utf-8 -*-
-
-from __future__ import absolute_import, print_function, unicode_literals
-
-import io
-import json
-import os
-import signal
-import sys
-import tempfile
-import unittest
-from configparser import ConfigParser
-from datetime import datetime, timedelta, timezone
-from io import BytesIO
-from glob import glob
-from pathlib import Path
-from tempfile import NamedTemporaryFile
-from typing import BinaryIO, cast
-from types import SimpleNamespace
-from unittest.mock import MagicMock, patch
-
-from expiringdict import ExpiringDict
-from lxml import etree  # type: ignore[import-untyped]
-
-import dns.exception
-import requests
-
-import parsedmarc
-import parsedmarc.cli
-import parsedmarc.webhook
-from parsedmarc.types import AggregateReport, FailureReport, SMTPTLSReport
-import parsedmarc.elastic
-import parsedmarc.opensearch as opensearch_module
-import parsedmarc.utils
-
-# Detect if running in GitHub Actions to skip DNS lookups
-OFFLINE_MODE = os.environ.get("GITHUB_ACTIONS", "false").lower() == "true"
-
-
-def minify_xml(xml_string):
-    parser = etree.XMLParser(remove_blank_text=True)
-    tree = etree.fromstring(xml_string.encode("utf-8"), parser)
-    return etree.tostring(tree, pretty_print=False).decode("utf-8")
-
-
-def compare_xml(xml1, xml2):
-    parser = etree.XMLParser(remove_blank_text=True)
-    tree1 = etree.fromstring(xml1.encode("utf-8"), parser)
-    tree2 = etree.fromstring(xml2.encode("utf-8"), parser)
-    return etree.tostring(tree1) == etree.tostring(tree2)
-
-
-class Test(unittest.TestCase):
-    def testBase64Decoding(self):
-        """Test base64 decoding"""
-        # Example from Wikipedia Base64 article
-        b64_str = "YW55IGNhcm5hbCBwbGVhcw"
-        decoded_str = parsedmarc.utils.decode_base64(b64_str)
-        self.assertEqual(decoded_str, b"any carnal pleas")
-
-    def testPSLDownload(self):
-        """Test Public Suffix List domain lookups"""
-        subdomain = "foo.example.com"
-        result = parsedmarc.utils.get_base_domain(subdomain)
-        self.assertEqual(result, "example.com")
-
-        # psl_overrides.txt intentionally folds CDN-customer PTRs so every
-        # sender on the same network clusters under one display key.
-        # ``.akamaiedge.net`` is an override, so its subdomains collapse to
-        # ``akamaiedge.net`` even though the live PSL carries the finer-grained
-        # ``c.akamaiedge.net`` — the override is the design decision.
-        subdomain = "e3191.c.akamaiedge.net"
-        result = parsedmarc.utils.get_base_domain(subdomain)
-        assert result == "akamaiedge.net"
-
-    def testExtractReportXMLComparator(self):
-        """Test XML comparator function"""
-        with open("samples/extract_report/nice-input.xml") as f:
-            xmlnice = f.read()
-        with open("samples/extract_report/changed-input.xml") as f:
-            xmlchanged = minify_xml(f.read())
-        self.assertTrue(compare_xml(xmlnice, xmlnice))
-        self.assertTrue(compare_xml(xmlchanged, xmlchanged))
-        self.assertFalse(compare_xml(xmlnice, xmlchanged))
-        self.assertFalse(compare_xml(xmlchanged, xmlnice))
-        print("Passed!")
-
-    def testExtractReportBytes(self):
-        """Test extract report function for bytes string input"""
-        print()
-        file = "samples/extract_report/nice-input.xml"
-        with open(file, "rb") as f:
-            data = f.read()
-        print("Testing {0}: ".format(file), end="")
-        xmlout = parsedmarc.extract_report(data)
-        with open("samples/extract_report/nice-input.xml") as f:
-            xmlin = f.read()
-        self.assertTrue(compare_xml(xmlout, xmlin))
-        print("Passed!")
-
-    def testExtractReportXML(self):
-        """Test extract report function for XML input"""
-        print()
-        file = "samples/extract_report/nice-input.xml"
-        print("Testing {0}: ".format(file), end="")
-        xmlout = parsedmarc.extract_report_from_file_path(file)
-        with open("samples/extract_report/nice-input.xml") as f:
-            xmlin = f.read()
-        self.assertTrue(compare_xml(xmlout, xmlin))
-        print("Passed!")
-
-    def testExtractReportXMLFromPath(self):
-        """Test extract report function for pathlib.Path input"""
-        report_path = Path("samples/extract_report/nice-input.xml")
-        xmlout = parsedmarc.extract_report_from_file_path(report_path)
-        with open("samples/extract_report/nice-input.xml") as xmlin_file:
-            xmlin = xmlin_file.read()
-        self.assertTrue(compare_xml(xmlout, xmlin))
-
-    def testExtractReportGZip(self):
-        """Test extract report function for gzip input"""
-        print()
-        file = "samples/extract_report/nice-input.xml.gz"
-        print("Testing {0}: ".format(file), end="")
-        xmlout = parsedmarc.extract_report_from_file_path(file)
-        with open("samples/extract_report/nice-input.xml") as f:
-            xmlin = f.read()
-        self.assertTrue(compare_xml(xmlout, xmlin))
-        print("Passed!")
-
-    def testExtractReportZip(self):
-        """Test extract report function for zip input"""
-        print()
-        file = "samples/extract_report/nice-input.xml.zip"
-        print("Testing {0}: ".format(file), end="")
-        xmlout = parsedmarc.extract_report_from_file_path(file)
-        with open("samples/extract_report/nice-input.xml") as f:
-            xmlin = minify_xml(f.read())
-        self.assertTrue(compare_xml(xmlout, xmlin))
-        with open("samples/extract_report/changed-input.xml") as f:
-            xmlin = f.read()
-        self.assertFalse(compare_xml(xmlout, xmlin))
-        print("Passed!")
-
-    def testParseReportFileAcceptsPathForXML(self):
-        report_path = Path(
-            "samples/aggregate/protection.outlook.com!example.com!1711756800!1711843200.xml"
-        )
-        result = parsedmarc.parse_report_file(
-            report_path,
-            offline=True,
-        )
-        assert result["report_type"] == "aggregate"
-        report = cast(AggregateReport, result["report"])
-        self.assertEqual(report["report_metadata"]["org_name"], "outlook.com")
-
-    def testParseReportFileAcceptsPathForEmail(self):
-        report_path = Path(
-            "samples/aggregate/Report domain- borschow.com Submitter- google.com Report-ID- 949348866075514174.eml"
-        )
-        result = parsedmarc.parse_report_file(
-            report_path,
-            offline=True,
-        )
-        assert result["report_type"] == "aggregate"
-        report = cast(AggregateReport, result["report"])
-        self.assertEqual(report["report_metadata"]["org_name"], "google.com")
-
-    def testAggregateSamples(self):
-        """Test sample aggregate/rua DMARC reports"""
-        print()
-        sample_paths = glob("samples/aggregate/*")
-        for sample_path in sample_paths:
-            if os.path.isdir(sample_path):
-                continue
-            print("Testing {0}: ".format(sample_path), end="")
-            with self.subTest(sample=sample_path):
-                result = parsedmarc.parse_report_file(
-                    sample_path, always_use_local_files=True, offline=OFFLINE_MODE
-                )
-                assert result["report_type"] == "aggregate"
-                parsedmarc.parsed_aggregate_reports_to_csv(
-                    cast(AggregateReport, result["report"])
-                )
-            print("Passed!")
-
-    def testEmptySample(self):
-        """Test empty/unparasable report"""
-        with self.assertRaises(parsedmarc.ParserError):
-            parsedmarc.parse_report_file("samples/empty.xml", offline=OFFLINE_MODE)
-
-    def testFailureSamples(self):
-        """Test sample failure/ruf DMARC reports"""
-        print()
-        sample_paths = glob("samples/failure/*.eml")
-        for sample_path in sample_paths:
-            print("Testing {0}: ".format(sample_path), end="")
-            with self.subTest(sample=sample_path):
-                with open(sample_path) as sample_file:
-                    sample_content = sample_file.read()
-                    email_result = parsedmarc.parse_report_email(
-                        sample_content, offline=OFFLINE_MODE
-                    )
-                    assert email_result["report_type"] == "failure"
-                result = parsedmarc.parse_report_file(sample_path, offline=OFFLINE_MODE)
-                assert result["report_type"] == "failure"
-                parsedmarc.parsed_failure_reports_to_csv(
-                    cast(FailureReport, result["report"])
-                )
-            print("Passed!")
-
-    def testFailureReportBackwardCompat(self):
-        """Test that old forensic function aliases still work"""
-        self.assertIs(
-            parsedmarc.parse_forensic_report,
-            parsedmarc.parse_failure_report,
-        )
-        self.assertIs(
-            parsedmarc.parsed_forensic_reports_to_csv,
-            parsedmarc.parsed_failure_reports_to_csv,
-        )
-        self.assertIs(
-            parsedmarc.parsed_forensic_reports_to_csv_rows,
-            parsedmarc.parsed_failure_reports_to_csv_rows,
-        )
-        self.assertIs(
-            parsedmarc.InvalidForensicReport,
-            parsedmarc.InvalidFailureReport,
-        )
-
-    def testRFC9990SampleReport(self):
-        """Test parsing the sample report from RFC 9990 Appendix B"""
-        print()
-        sample_path = "samples/aggregate/rfc9990-sample.xml"
-        print("Testing {0}: ".format(sample_path), end="")
-        result = parsedmarc.parse_report_file(
-            sample_path, always_use_local_files=True, offline=True
-        )
-        report = cast(AggregateReport, result["report"])
-
-        # Verify report_type
-        self.assertEqual(result["report_type"], "aggregate")
-
-        # Verify xml_schema
-        self.assertEqual(report["xml_schema"], "1.0")
-
-        # Verify report_metadata
-        metadata = report["report_metadata"]
-        self.assertEqual(metadata["org_name"], "Sample Reporter")
-        self.assertEqual(metadata["org_email"], "report_sender@example-reporter.com")
-        self.assertEqual(metadata["org_extra_contact_info"], "...")
-        self.assertEqual(metadata["report_id"], "3v98abbp8ya9n3va8yr8oa3ya")
-        self.assertEqual(
-            metadata["generator"],
-            "Example DMARC Aggregate Reporter v1.2",
-        )
-
-        # Verify RFC 9990 policy_published fields
-        pp = report["policy_published"]
-        self.assertEqual(pp["domain"], "example.com")
-        self.assertEqual(pp["p"], "quarantine")
-        self.assertEqual(pp["sp"], "none")
-        self.assertEqual(pp["np"], "none")
-        self.assertEqual(pp["testing"], "n")
-        self.assertEqual(pp["discovery_method"], "treewalk")
-        # adkim/aspf default when not in XML
-        self.assertEqual(pp["adkim"], "r")
-        self.assertEqual(pp["aspf"], "r")
-        # pct is removed in RFC 9989 (and so absent from the RFC 9990
-        # sample); fo is still part of RFC 9990's PolicyPublishedType but
-        # the appendix sample happens not to set it.
-        self.assertIsNone(pp["pct"])
-        self.assertIsNone(pp["fo"])
-
-        # Verify record
-        self.assertEqual(len(report["records"]), 1)
-        rec = report["records"][0]
-        self.assertEqual(rec["source"]["ip_address"], "192.0.2.123")
-        self.assertEqual(rec["count"], 123)
-        self.assertEqual(rec["policy_evaluated"]["disposition"], "pass")
-        self.assertEqual(rec["policy_evaluated"]["dkim"], "pass")
-        self.assertEqual(rec["policy_evaluated"]["spf"], "fail")
-
-        # Verify DKIM auth result with human_result
-        self.assertEqual(len(rec["auth_results"]["dkim"]), 1)
-        dkim = rec["auth_results"]["dkim"][0]
-        self.assertEqual(dkim["domain"], "example.com")
-        self.assertEqual(dkim["selector"], "abc123")
-        self.assertEqual(dkim["result"], "pass")
-        self.assertIsNone(dkim["human_result"])
-
-        # Verify SPF auth result with human_result
-        self.assertEqual(len(rec["auth_results"]["spf"]), 1)
-        spf = rec["auth_results"]["spf"][0]
-        self.assertEqual(spf["domain"], "example.com")
-        self.assertEqual(spf["result"], "fail")
-        self.assertIsNone(spf["human_result"])
-
-        # Verify CSV output includes new fields
-        csv = parsedmarc.parsed_aggregate_reports_to_csv(report)
-        header = csv.split("\n")[0]
-        self.assertIn("np", header.split(","))
-        self.assertIn("testing", header.split(","))
-        self.assertIn("discovery_method", header.split(","))
-        print("Passed!")
-
-    def testRFC9990FieldsAbsentFromRFC7489Report(self):
-        """Test that RFC 7489 reports have None for RFC 9990-only fields"""
-        print()
-        sample_path = (
-            "samples/aggregate/example.net!example.com!1529366400!1529452799.xml"
-        )
-        print("Testing {0}: ".format(sample_path), end="")
-        result = parsedmarc.parse_report_file(
-            sample_path, always_use_local_files=True, offline=True
-        )
-        report = cast(AggregateReport, result["report"])
-        pp = report["policy_published"]
-
-        # RFC 7489 fields present
-        self.assertEqual(pp["pct"], "100")
-        self.assertEqual(pp["fo"], "0")
-
-        # RFC 9990-only fields absent (None)
-        self.assertIsNone(pp["np"])
-        self.assertIsNone(pp["testing"])
-        self.assertIsNone(pp["discovery_method"])
-
-        # generator absent (None)
-        self.assertIsNone(report["report_metadata"]["generator"])
-        print("Passed!")
-
-    def testRFC9990WithExplicitFields(self):
-        """Test RFC 9990 report with explicit testing and discovery_method"""
-        print()
-        sample_path = (
-            "samples/aggregate/"
-            "rfc9990-example.net!example.com!1700000000!1700086399.xml"
-        )
-        print("Testing {0}: ".format(sample_path), end="")
-        result = parsedmarc.parse_report_file(
-            sample_path, always_use_local_files=True, offline=True
-        )
-        report = cast(AggregateReport, result["report"])
-        pp = report["policy_published"]
-
-        self.assertEqual(pp["np"], "reject")
-        self.assertEqual(pp["testing"], "y")
-        self.assertEqual(pp["discovery_method"], "treewalk")
-        print("Passed!")
-
-    def testRFC9990NamespaceCaptured(self):
-        """The dmarc-2.0 namespace on <feedback> is preserved on the
-        parsed report so consumers can distinguish RFC 9990 from RFC 7489
-        reports without inferring from the version element value."""
-        result = parsedmarc.parse_report_file(
-            "samples/aggregate/rfc9990-sample.xml",
-            always_use_local_files=True,
-            offline=True,
-        )
-        report = cast(AggregateReport, result["report"])
-        self.assertEqual(
-            report["xml_namespace"],
-            "urn:ietf:params:xml:ns:dmarc-2.0",
-        )
-
-    def testRFC9990NamespaceAbsentOnRFC7489Report(self):
-        """RFC 7489 reports don't declare the dmarc-2.0 namespace, so
-        xml_namespace is None."""
-        result = parsedmarc.parse_report_file(
-            "samples/aggregate/example.net!example.com!1529366400!1529452799.xml",
-            always_use_local_files=True,
-            offline=True,
-        )
-        report = cast(AggregateReport, result["report"])
-        self.assertIsNone(report["xml_namespace"])
-
-    def testRFC9990DetectionAcceptsNamespacelessReports(self):
-        """A report that follows the RFC 9990 shape without declaring the
-        namespace (e.g. emits np/testing/discovery_method) is still
-        treated as RFC 9990 for validation purposes — warnings fire,
-        the namespace field reports it honestly as absent."""
-        with self.assertLogs("parsedmarc.log", level="WARNING") as cm:
-            report = parsedmarc.parse_aggregate_report_xml(
-                """<?xml version="1.0"?>
-                <feedback>
-                    <report_metadata>
-                        <org_name>Test</org_name>
-                        <email>t@example.com</email>
-                        <report_id>r1</report_id>
-                        <date_range><begin>1700000000</begin><end>1700086399</end></date_range>
-                    </report_metadata>
-                    <policy_published>
-                        <domain>example.com</domain>
-                        <p>none</p>
-                        <np>reject</np>
-                    </policy_published>
-                    <record>
-                        <row>
-                            <source_ip>192.0.2.1</source_ip>
-                            <count>1</count>
-                            <policy_evaluated>
-                                <disposition>none</disposition>
-                                <dkim>pass</dkim>
-                                <spf>pass</spf>
-                            </policy_evaluated>
-                        </row>
-                        <identifiers><header_from>example.com</header_from></identifiers>
-                        <auth_results>
-                            <dkim>
-                                <domain>example.com</domain>
-                                <result>pass</result>
-                            </dkim>
-                        </auth_results>
-                    </record>
-                </feedback>""",
-                offline=True,
-            )
-        # Namespace honestly None because none was declared.
-        self.assertIsNone(report["xml_namespace"])
-        # RFC 9990 detection still fired (DKIM selector warning emitted).
-        self.assertTrue(
-            any("selector" in msg for msg in cm.output),
-            f"Expected DKIM selector warning; got: {cm.output}",
-        )
-
-    def testRFC9990DKIMMissingSelectorWarning(self):
-        """A DKIM auth result with no <selector> in an RFC 9990 report
-        (namespace declared) emits a warning since selector is REQUIRED."""
-        xml = """<?xml version="1.0"?>
-        <feedback xmlns="urn:ietf:params:xml:ns:dmarc-2.0">
-            <version>1.0</version>
-            <report_metadata>
-                <org_name>Test</org_name>
-                <email>t@example.com</email>
-                <report_id>r1</report_id>
-                <date_range><begin>1700000000</begin><end>1700086399</end></date_range>
-            </report_metadata>
-            <policy_published>
-                <domain>example.com</domain>
-                <p>none</p>
-            </policy_published>
-            <record>
-                <row>
-                    <source_ip>192.0.2.1</source_ip>
-                    <count>1</count>
-                    <policy_evaluated>
-                        <disposition>none</disposition>
-                        <dkim>pass</dkim>
-                        <spf>pass</spf>
-                    </policy_evaluated>
-                </row>
-                <identifiers><header_from>example.com</header_from></identifiers>
-                <auth_results>
-                    <dkim>
-                        <domain>example.com</domain>
-                        <result>pass</result>
-                    </dkim>
-                </auth_results>
-            </record>
-        </feedback>"""
-        with self.assertLogs("parsedmarc.log", level="WARNING") as cm:
-            parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-        self.assertTrue(
-            any("selector" in m and "REQUIRED" in m for m in cm.output),
-            f"Expected selector REQUIRED warning; got: {cm.output}",
-        )
-
-    def testRFC9990LegacyOverrideTypeWarning(self):
-        """`forwarded` and `sampled_out` were removed in RFC 9990;
-        a warning fires when they appear in an RFC 9990 report."""
-        xml = """<?xml version="1.0"?>
-        <feedback xmlns="urn:ietf:params:xml:ns:dmarc-2.0">
-            <report_metadata>
-                <org_name>Test</org_name>
-                <email>t@example.com</email>
-                <report_id>r1</report_id>
-                <date_range><begin>1700000000</begin><end>1700086399</end></date_range>
-            </report_metadata>
-            <policy_published>
-                <domain>example.com</domain>
-                <p>none</p>
-            </policy_published>
-            <record>
-                <row>
-                    <source_ip>192.0.2.1</source_ip>
-                    <count>1</count>
-                    <policy_evaluated>
-                        <disposition>none</disposition>
-                        <dkim>pass</dkim>
-                        <spf>pass</spf>
-                        <reason><type>forwarded</type></reason>
-                    </policy_evaluated>
-                </row>
-                <identifiers><header_from>example.com</header_from></identifiers>
-                <auth_results>
-                    <dkim>
-                        <domain>example.com</domain>
-                        <selector>s</selector>
-                        <result>pass</result>
-                    </dkim>
-                </auth_results>
-            </record>
-        </feedback>"""
-        with self.assertLogs("parsedmarc.log", level="WARNING") as cm:
-            parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-        self.assertTrue(
-            any("forwarded" in m and "removed in RFC 9990" in m for m in cm.output),
-            f"Expected legacy override warning; got: {cm.output}",
-        )
-
-    def testRFC9990LangAttrStringUnwrapped(self):
-        """When a langAttrString element (extra_contact_info, error,
-        comment, human_result) carries a lang attribute, xmltodict turns
-        it into {"#text": "...", "@lang": "en"}; the parser must unwrap
-        to the text payload so the report stays comparable to one
-        without the lang attribute."""
-        xml = """<?xml version="1.0"?>
-        <feedback xmlns="urn:ietf:params:xml:ns:dmarc-2.0">
-            <report_metadata>
-                <org_name>Test</org_name>
-                <email>t@example.com</email>
-                <extra_contact_info xml:lang="en">contact-here</extra_contact_info>
-                <report_id>r1</report_id>
-                <date_range><begin>1700000000</begin><end>1700086399</end></date_range>
-                <error xml:lang="en">a problem</error>
-            </report_metadata>
-            <policy_published>
-                <domain>example.com</domain>
-                <p>none</p>
-            </policy_published>
-            <record>
-                <row>
-                    <source_ip>192.0.2.1</source_ip>
-                    <count>1</count>
-                    <policy_evaluated>
-                        <disposition>none</disposition>
-                        <dkim>pass</dkim>
-                        <spf>pass</spf>
-                        <reason>
-                            <type>local_policy</type>
-                            <comment xml:lang="en">a comment</comment>
-                        </reason>
-                    </policy_evaluated>
-                </row>
-                <identifiers><header_from>example.com</header_from></identifiers>
-                <auth_results>
-                    <dkim>
-                        <domain>example.com</domain>
-                        <selector>s</selector>
-                        <result>pass</result>
-                        <human_result xml:lang="en">looks fine</human_result>
-                    </dkim>
-                    <spf>
-                        <domain>example.com</domain>
-                        <result>pass</result>
-                        <human_result xml:lang="en">spf-detail</human_result>
-                    </spf>
-                </auth_results>
-            </record>
-        </feedback>"""
-        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-        self.assertEqual(
-            report["report_metadata"]["org_extra_contact_info"], "contact-here"
-        )
-        self.assertEqual(report["report_metadata"]["errors"], ["a problem"])
-        rec = report["records"][0]
-        reasons = rec["policy_evaluated"]["policy_override_reasons"]
-        self.assertEqual(reasons[0]["comment"], "a comment")
-        self.assertEqual(rec["auth_results"]["dkim"][0]["human_result"], "looks fine")
-        self.assertEqual(rec["auth_results"]["spf"][0]["human_result"], "spf-detail")
-
-    def testSmtpTlsSamples(self):
-        """Test sample SMTP TLS reports"""
-        print()
-        sample_paths = glob("samples/smtp_tls/*")
-        for sample_path in sample_paths:
-            if os.path.isdir(sample_path):
-                continue
-            print("Testing {0}: ".format(sample_path), end="")
-            with self.subTest(sample=sample_path):
-                result = parsedmarc.parse_report_file(sample_path, offline=OFFLINE_MODE)
-                assert result["report_type"] == "smtp_tls"
-                parsedmarc.parsed_smtp_tls_reports_to_csv(
-                    cast(SMTPTLSReport, result["report"])
-                )
-            print("Passed!")
-
-    def testIpAddressInfoSurfacesASNFields(self):
-        """ASN number, name, and domain from the bundled MMDB appear on every
-        IP info result, even when no PTR resolves."""
-        info = parsedmarc.utils.get_ip_address_info("8.8.8.8", offline=True)
-        self.assertEqual(info["asn"], 15169)
-        self.assertIsInstance(info["asn"], int)
-        self.assertEqual(info["as_domain"], "google.com")
-        self.assertTrue(info["as_name"])
-
-    def testIpAddressInfoFallsBackToASNMapEntryWhenNoPTR(self):
-        """When reverse DNS is absent, the ASN domain should be used as a
-        lookup into the reverse_dns_map so the row still gets attributed,
-        while reverse_dns and base_domain remain null."""
-        info = parsedmarc.utils.get_ip_address_info("8.8.8.8", offline=True)
-        self.assertIsNone(info["reverse_dns"])
-        self.assertIsNone(info["base_domain"])
-        self.assertEqual(info["name"], "Google (Including Gmail and Google Workspace)")
-        self.assertEqual(info["type"], "Email Provider")
-
-    def testIpAddressInfoFallsBackToRawASNameOnMapMiss(self):
-        """When neither PTR nor an ASN-map entry resolves, the raw AS name
-        is used as source_name with type left null — better than leaving
-        the row unattributed."""
-        # 204.79.197.100 is in an ASN whose as_domain is not in the map at
-        # the time of this test (msn.com); this exercises the as_name
-        # fallback branch without depending on a specific map state.
-        from unittest.mock import patch
-
-        with patch(
-            "parsedmarc.utils.get_ip_address_db_record",
-            return_value={
-                "country": "US",
-                "asn": 64496,
-                "as_name": "Some Unmapped Org, Inc.",
-                "as_domain": "unmapped-for-this-test.example",
-            },
-        ):
-            # Bypass cache to avoid prior-test pollution.
-            info = parsedmarc.utils.get_ip_address_info(
-                "192.0.2.1", offline=True, cache=None
-            )
-        self.assertIsNone(info["reverse_dns"])
-        self.assertIsNone(info["base_domain"])
-        self.assertIsNone(info["type"])
-        self.assertEqual(info["name"], "Some Unmapped Org, Inc.")
-        self.assertEqual(info["as_domain"], "unmapped-for-this-test.example")
-
-    def testWeakFallbackAttributionIsNotCached(self):
-        """A transient PTR lookup failure that lands on the raw-as_name
-        fallback must not poison the cache. ``get_reverse_dns()`` swallows
-        every DNSException as ``None``, so a timeout looks identical to a
-        real no-PTR case — if we cached the weak attribution, the 4-hour
-        TTL would lock in a misattribution even after the PTR returns.
-
-        PTR-backed matches and ASN-domain matches are stable attributions
-        and must still be cached, so we only skip the specific
-        ``reverse_dns=None AND type=None AND name=as_name`` state."""
-        from unittest.mock import patch
-        from expiringdict import ExpiringDict
-
-        cache = ExpiringDict(max_len=100, max_age_seconds=14400)
-
-        # Scenario 1: weak fallback (no PTR, unmapped as_domain, raw as_name
-        # used). Must NOT be cached.
-        with patch(
-            "parsedmarc.utils.get_ip_address_db_record",
-            return_value={
-                "country": "US",
-                "asn": 64496,
-                "as_name": "Some Unmapped Org, Inc.",
-                "as_domain": "unmapped-for-this-test.example",
-            },
-        ):
-            parsedmarc.utils.get_ip_address_info("192.0.2.1", offline=True, cache=cache)
-        self.assertNotIn("192.0.2.1", cache)
-
-        # Scenario 2: ASN-domain match (no PTR, as_domain IS in the map).
-        # Stable attribution — must still be cached.
-        with patch(
-            "parsedmarc.utils.get_ip_address_db_record",
-            return_value={
-                "country": "US",
-                "asn": 15169,
-                "as_name": "Google LLC",
-                "as_domain": "google.com",
-            },
-        ):
-            parsedmarc.utils.get_ip_address_info("192.0.2.2", offline=True, cache=cache)
-        self.assertIn("192.0.2.2", cache)
-
-    def testIPinfoAPIPrimarySourceAndInvalidKeyIsFatal(self):
-        """With an API token configured, lookups hit the API first via the
-        documented ?token= query param. A 401/403 response propagates as
-        ``InvalidIPinfoAPIKey`` so the CLI can exit fatally. Any other
-        non-2xx or network error falls through to the MMDB silently.
-
-        The IPinfo Lite API is documented as having no request limit, so
-        there is no rate-limit/quota handling to test — only the fatal path
-        on invalid tokens and the success path."""
-        from unittest.mock import patch, MagicMock
-
-        from parsedmarc.utils import (
-            InvalidIPinfoAPIKey,
-            configure_ipinfo_api,
-            get_ip_address_db_record,
-        )
-
-        def _mock_response(status_code, json_body=None):
-            resp = MagicMock()
-            resp.status_code = status_code
-            resp.ok = 200 <= status_code < 300
-            resp.json.return_value = json_body or {}
-            return resp
-
-        try:
-            # Success: API returns IPinfo-schema JSON; record comes from API.
-            api_json = {
-                "ip": "8.8.8.8",
-                "asn": "AS15169",
-                "as_name": "Google LLC",
-                "as_domain": "google.com",
-                "country_code": "US",
-            }
-            with patch(
-                "parsedmarc.utils.requests.get",
-                return_value=_mock_response(200, api_json),
-            ) as mock_get:
-                configure_ipinfo_api("fake-token", probe=False)
-                record = get_ip_address_db_record("8.8.8.8")
-            self.assertEqual(record["country"], "US")
-            self.assertEqual(record["asn"], 15169)
-            self.assertEqual(record["as_domain"], "google.com")
-            # Auth must use the documented query param, not a Bearer header.
-            _, kwargs = mock_get.call_args
-            self.assertEqual(kwargs["params"], {"token": "fake-token"})
-            self.assertNotIn("Authorization", kwargs["headers"])
-
-            # Invalid key: 401 raises a fatal exception even on a random lookup.
-            with patch(
-                "parsedmarc.utils.requests.get",
-                return_value=_mock_response(401),
-            ):
-                configure_ipinfo_api("bad-token", probe=False)
-                with self.assertRaises(InvalidIPinfoAPIKey):
-                    get_ip_address_db_record("8.8.8.8")
-
-            # Any other non-2xx (e.g. 500, 503) falls back to the MMDB silently.
-            configure_ipinfo_api("fake-token", probe=False)
-            with patch(
-                "parsedmarc.utils.requests.get",
-                return_value=_mock_response(500),
-            ):
-                record = get_ip_address_db_record("8.8.8.8")
-            # MMDB fallback fills in Google's ASN from the bundled MMDB.
-            self.assertEqual(record["asn"], 15169)
-        finally:
-            configure_ipinfo_api(None)
-
-    def testAggregateCsvExposesASNColumns(self):
-        """The aggregate CSV output should include source_asn, source_as_name,
-        and source_as_domain columns."""
-        result = parsedmarc.parse_report_file(
-            "samples/aggregate/!example.com!1538204542!1538463818.xml",
-            always_use_local_files=True,
-            offline=True,
-        )
-        csv_text = parsedmarc.parsed_aggregate_reports_to_csv(result["report"])
-        header = csv_text.splitlines()[0].split(",")
-        self.assertIn("source_asn", header)
-        self.assertIn("source_as_name", header)
-        self.assertIn("source_as_domain", header)
-
-    def testOpenSearchSigV4RequiresRegion(self):
-        with self.assertRaises(opensearch_module.OpenSearchError):
-            opensearch_module.set_hosts(
-                "https://example.org:9200",
-                auth_type="awssigv4",
-            )
-
-    def testOpenSearchSigV4ConfiguresConnectionClass(self):
-        fake_credentials = object()
-        with patch.object(opensearch_module.boto3, "Session") as session_cls:
-            session_cls.return_value.get_credentials.return_value = fake_credentials
-            with patch.object(
-                opensearch_module, "AWSV4SignerAuth", return_value="auth"
-            ) as signer:
-                with patch.object(
-                    opensearch_module.connections, "create_connection"
-                ) as create_connection:
-                    opensearch_module.set_hosts(
-                        "https://example.org:9200",
-                        use_ssl=True,
-                        auth_type="awssigv4",
-                        aws_region="eu-west-1",
-                    )
-        signer.assert_called_once_with(fake_credentials, "eu-west-1", "es")
-        create_connection.assert_called_once()
-        self.assertEqual(
-            create_connection.call_args.kwargs.get("connection_class"),
-            opensearch_module.RequestsHttpConnection,
-        )
-        self.assertEqual(create_connection.call_args.kwargs.get("http_auth"), "auth")
-
-    def testOpenSearchSigV4RejectsUnknownAuthType(self):
-        with self.assertRaises(opensearch_module.OpenSearchError):
-            opensearch_module.set_hosts(
-                "https://example.org:9200",
-                auth_type="kerberos",
-            )
-
-    def testOpenSearchSigV4RequiresAwsCredentials(self):
-        with patch.object(opensearch_module.boto3, "Session") as session_cls:
-            session_cls.return_value.get_credentials.return_value = None
-            with self.assertRaises(opensearch_module.OpenSearchError):
-                opensearch_module.set_hosts(
-                    "https://example.org:9200",
-                    auth_type="awssigv4",
-                    aws_region="eu-west-1",
-                )
-
-    @patch("parsedmarc.cli.opensearch.migrate_indexes")
-    @patch("parsedmarc.cli.opensearch.set_hosts")
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.IMAPConnection")
-    def testCliPassesOpenSearchSigV4Settings(
-        self,
-        mock_imap_connection,
-        mock_get_reports,
-        mock_set_hosts,
-        _mock_migrate_indexes,
-    ):
-        mock_imap_connection.return_value = object()
-        mock_get_reports.return_value = {
-            "aggregate_reports": [],
-            "failure_reports": [],
-            "smtp_tls_reports": [],
-        }
-
-        config = """[general]
-save_aggregate = true
-silent = true
-
-[imap]
-host = imap.example.com
-user = test-user
-password = test-password
-
-[opensearch]
-hosts = localhost
-authentication_type = awssigv4
-aws_region = eu-west-1
-aws_service = aoss
-"""
-        with tempfile.NamedTemporaryFile(
-            "w", suffix=".ini", delete=False
-        ) as config_file:
-            config_file.write(config)
-            config_path = config_file.name
-        self.addCleanup(lambda: os.path.exists(config_path) and os.remove(config_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", config_path]):
-            parsedmarc.cli._main()
-
-        self.assertEqual(mock_set_hosts.call_args.kwargs.get("auth_type"), "awssigv4")
-        self.assertEqual(mock_set_hosts.call_args.kwargs.get("aws_region"), "eu-west-1")
-        self.assertEqual(mock_set_hosts.call_args.kwargs.get("aws_service"), "aoss")
-
-    @patch("parsedmarc.cli.elastic.save_aggregate_report_to_elasticsearch")
-    @patch("parsedmarc.cli.elastic.migrate_indexes")
-    @patch("parsedmarc.cli.elastic.set_hosts")
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.IMAPConnection")
-    def testFailOnOutputErrorExits(
-        self,
-        mock_imap_connection,
-        mock_get_reports,
-        _mock_set_hosts,
-        _mock_migrate_indexes,
-        mock_save_aggregate,
-    ):
-        """CLI should exit with code 1 when fail_on_output_error is enabled"""
-        mock_imap_connection.return_value = object()
-        mock_get_reports.return_value = {
-            "aggregate_reports": [{"policy_published": {"domain": "example.com"}}],
-            "failure_reports": [],
-            "smtp_tls_reports": [],
-        }
-        mock_save_aggregate.side_effect = parsedmarc.elastic.ElasticsearchError(
-            "simulated output failure"
-        )
-
-        config = """[general]
-save_aggregate = true
-fail_on_output_error = true
-silent = true
-
-[imap]
-host = imap.example.com
-user = test-user
-password = test-password
-
-[elasticsearch]
-hosts = localhost
-"""
-        with tempfile.NamedTemporaryFile(
-            "w", suffix=".ini", delete=False
-        ) as config_file:
-            config_file.write(config)
-            config_path = config_file.name
-        self.addCleanup(lambda: os.path.exists(config_path) and os.remove(config_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", config_path]):
-            with self.assertRaises(SystemExit) as ctx:
-                parsedmarc.cli._main()
-
-        self.assertEqual(ctx.exception.code, 1)
-        mock_save_aggregate.assert_called_once()
-
-    @patch("parsedmarc.cli.elastic.save_aggregate_report_to_elasticsearch")
-    @patch("parsedmarc.cli.elastic.migrate_indexes")
-    @patch("parsedmarc.cli.elastic.set_hosts")
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.IMAPConnection")
-    def testOutputErrorDoesNotExitWhenDisabled(
-        self,
-        mock_imap_connection,
-        mock_get_reports,
-        _mock_set_hosts,
-        _mock_migrate_indexes,
-        mock_save_aggregate,
-    ):
-        mock_imap_connection.return_value = object()
-        mock_get_reports.return_value = {
-            "aggregate_reports": [{"policy_published": {"domain": "example.com"}}],
-            "failure_reports": [],
-            "smtp_tls_reports": [],
-        }
-        mock_save_aggregate.side_effect = parsedmarc.elastic.ElasticsearchError(
-            "simulated output failure"
-        )
-
-        config = """[general]
-save_aggregate = true
-fail_on_output_error = false
-silent = true
-
-[imap]
-host = imap.example.com
-user = test-user
-password = test-password
-
-[elasticsearch]
-hosts = localhost
-"""
-        with tempfile.NamedTemporaryFile(
-            "w", suffix=".ini", delete=False
-        ) as config_file:
-            config_file.write(config)
-            config_path = config_file.name
-        self.addCleanup(lambda: os.path.exists(config_path) and os.remove(config_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", config_path]):
-            parsedmarc.cli._main()
-
-        mock_save_aggregate.assert_called_once()
-
-    @patch("parsedmarc.cli.opensearch.save_failure_report_to_opensearch")
-    @patch("parsedmarc.cli.opensearch.migrate_indexes")
-    @patch("parsedmarc.cli.opensearch.set_hosts")
-    @patch("parsedmarc.cli.elastic.save_failure_report_to_elasticsearch")
-    @patch("parsedmarc.cli.elastic.save_aggregate_report_to_elasticsearch")
-    @patch("parsedmarc.cli.elastic.migrate_indexes")
-    @patch("parsedmarc.cli.elastic.set_hosts")
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.IMAPConnection")
-    def testFailOnOutputErrorExitsWithMultipleSinkErrors(
-        self,
-        mock_imap_connection,
-        mock_get_reports,
-        _mock_es_set_hosts,
-        _mock_es_migrate,
-        mock_save_aggregate,
-        _mock_save_failure_elastic,
-        _mock_os_set_hosts,
-        _mock_os_migrate,
-        mock_save_failure_opensearch,
-    ):
-        mock_imap_connection.return_value = object()
-        mock_get_reports.return_value = {
-            "aggregate_reports": [{"policy_published": {"domain": "example.com"}}],
-            "failure_reports": [{"reported_domain": "example.com"}],
-            "smtp_tls_reports": [],
-        }
-        mock_save_aggregate.side_effect = parsedmarc.elastic.ElasticsearchError(
-            "aggregate sink failed"
-        )
-        mock_save_failure_opensearch.side_effect = (
-            parsedmarc.cli.opensearch.OpenSearchError("failure sink failed")
-        )
-
-        config = """[general]
-save_aggregate = true
-save_failure = true
-fail_on_output_error = true
-silent = true
-
-[imap]
-host = imap.example.com
-user = test-user
-password = test-password
-
-[elasticsearch]
-hosts = localhost
-
-[opensearch]
-hosts = localhost
-"""
-        with tempfile.NamedTemporaryFile(
-            "w", suffix=".ini", delete=False
-        ) as config_file:
-            config_file.write(config)
-            config_path = config_file.name
-        self.addCleanup(lambda: os.path.exists(config_path) and os.remove(config_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", config_path]):
-            with self.assertRaises(SystemExit) as ctx:
-                parsedmarc.cli._main()
-
-        self.assertEqual(ctx.exception.code, 1)
-        mock_save_aggregate.assert_called_once()
-        mock_save_failure_opensearch.assert_called_once()
-
-
-class _BreakLoop(BaseException):
-    pass
-
-
-class TestGmailAuthModes(unittest.TestCase):
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.GmailConnection")
-    def testCliPassesGmailServiceAccountAuthSettings(
-        self, mock_gmail_connection, mock_get_mailbox_reports
-    ):
-        mock_gmail_connection.return_value = MagicMock()
-        mock_get_mailbox_reports.return_value = {
-            "aggregate_reports": [],
-            "failure_reports": [],
-            "smtp_tls_reports": [],
-        }
-        config = """[general]
-silent = true
-
-[gmail_api]
-credentials_file = /tmp/service-account.json
-auth_mode = service_account
-service_account_user = dmarc@example.com
-scopes = https://www.googleapis.com/auth/gmail.modify
-"""
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg_file:
-            cfg_file.write(config)
-            config_path = cfg_file.name
-        self.addCleanup(lambda: os.path.exists(config_path) and os.remove(config_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", config_path]):
-            parsedmarc.cli._main()
-
-        self.assertEqual(
-            mock_gmail_connection.call_args.kwargs.get("auth_mode"), "service_account"
-        )
-        self.assertEqual(
-            mock_gmail_connection.call_args.kwargs.get("service_account_user"),
-            "dmarc@example.com",
-        )
-
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.GmailConnection")
-    def testCliAcceptsDelegatedUserAlias(self, mock_gmail_connection, mock_get_reports):
-        mock_gmail_connection.return_value = MagicMock()
-        mock_get_reports.return_value = {
-            "aggregate_reports": [],
-            "failure_reports": [],
-            "smtp_tls_reports": [],
-        }
-        config = """[general]
-silent = true
-
-[gmail_api]
-credentials_file = /tmp/service-account.json
-auth_mode = service_account
-delegated_user = delegated@example.com
-scopes = https://www.googleapis.com/auth/gmail.modify
-"""
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg_file:
-            cfg_file.write(config)
-            config_path = cfg_file.name
-        self.addCleanup(lambda: os.path.exists(config_path) and os.remove(config_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", config_path]):
-            parsedmarc.cli._main()
-
-        self.assertEqual(
-            mock_gmail_connection.call_args.kwargs.get("service_account_user"),
-            "delegated@example.com",
-        )
-
-
-class TestMailboxWatchSince(unittest.TestCase):
-    def setUp(self):
-        from parsedmarc.log import logger as _logger
-
-        _logger.disabled = True
-        self._stdout_patch = patch("sys.stdout", new_callable=io.StringIO)
-        self._stderr_patch = patch("sys.stderr", new_callable=io.StringIO)
-        self._stdout_patch.start()
-        self._stderr_patch.start()
-
-    def tearDown(self):
-        from parsedmarc.log import logger as _logger
-
-        _logger.disabled = False
-        self._stderr_patch.stop()
-        self._stdout_patch.stop()
-
-    def testWatchInboxPassesSinceToMailboxFetch(self):
-        mailbox_connection = SimpleNamespace()
-
-        def fake_watch(check_callback, check_timeout, config_reloading=None):
-            check_callback(mailbox_connection)
-            raise _BreakLoop()
-
-        mailbox_connection.watch = fake_watch
-        callback = MagicMock()
-        with patch.object(
-            parsedmarc, "get_dmarc_reports_from_mailbox", return_value={}
-        ) as mocked:
-            with self.assertRaises(_BreakLoop):
-                parsedmarc.watch_inbox(
-                    mailbox_connection=cast(
-                        parsedmarc.MailboxConnection, mailbox_connection
-                    ),
-                    callback=callback,
-                    check_timeout=1,
-                    batch_size=10,
-                    since="1d",
-                )
-        self.assertEqual(mocked.call_args.kwargs.get("since"), "1d")
-
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.watch_inbox")
-    @patch("parsedmarc.cli.IMAPConnection")
-    def testCliPassesSinceToWatchInbox(
-        self, mock_imap_connection, mock_watch_inbox, mock_get_mailbox_reports
-    ):
-        mock_imap_connection.return_value = object()
-        mock_get_mailbox_reports.return_value = {
-            "aggregate_reports": [],
-            "failure_reports": [],
-            "smtp_tls_reports": [],
-        }
-        mock_watch_inbox.side_effect = FileExistsError("stop-watch-loop")
-
-        config_text = """[general]
-silent = true
-
-[imap]
-host = imap.example.com
-user = user
-password = pass
-
-[mailbox]
-watch = true
-since = 2d
-"""
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(config_text)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            with self.assertRaises(SystemExit) as system_exit:
-                parsedmarc.cli._main()
-
-        self.assertEqual(system_exit.exception.code, 1)
-        self.assertEqual(mock_watch_inbox.call_args.kwargs.get("since"), "2d")
-
-
-class _DummyMailboxConnection(parsedmarc.MailboxConnection):
-    def __init__(self):
-        self.fetch_calls: list[dict[str, object]] = []
-
-    def create_folder(self, folder_name: str):
-        return None
-
-    def fetch_messages(self, reports_folder: str, **kwargs):
-        self.fetch_calls.append({"reports_folder": reports_folder, **kwargs})
-        return []
-
-    def fetch_message(self, message_id) -> str:
-        return ""
-
-    def delete_message(self, message_id):
-        return None
-
-    def move_message(self, message_id, folder_name: str):
-        return None
-
-    def keepalive(self):
-        return None
-
-    def watch(self, check_callback, check_timeout, config_reloading=None):
-        return None
-
-
-class TestMailboxPerformance(unittest.TestCase):
-    def setUp(self):
-        from parsedmarc.log import logger as _logger
-
-        _logger.disabled = True
-        self._stdout_patch = patch("sys.stdout", new_callable=io.StringIO)
-        self._stderr_patch = patch("sys.stderr", new_callable=io.StringIO)
-        self._stdout_patch.start()
-        self._stderr_patch.start()
-
-    def tearDown(self):
-        from parsedmarc.log import logger as _logger
-
-        _logger.disabled = False
-        self._stderr_patch.stop()
-        self._stdout_patch.stop()
-
-    def testBatchModeAvoidsExtraFullFetch(self):
-        connection = _DummyMailboxConnection()
-        parsedmarc.get_dmarc_reports_from_mailbox(
-            connection=connection,
-            reports_folder="INBOX",
-            test=True,
-            batch_size=10,
-            create_folders=False,
-        )
-        self.assertEqual(len(connection.fetch_calls), 1)
-
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.MSGraphConnection")
-    def testCliPassesMsGraphCertificateAuthSettings(
-        self, mock_graph_connection, mock_get_mailbox_reports
-    ):
-        mock_graph_connection.return_value = object()
-        mock_get_mailbox_reports.return_value = {
-            "aggregate_reports": [],
-            "failure_reports": [],
-            "smtp_tls_reports": [],
-        }
-
-        config_text = """[general]
-silent = true
-
-[msgraph]
-auth_method = Certificate
-client_id = client-id
-tenant_id = tenant-id
-mailbox = shared@example.com
-certificate_path = /tmp/msgraph-cert.pem
-certificate_password = cert-pass
-"""
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(config_text)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            parsedmarc.cli._main()
-
-        self.assertEqual(
-            mock_graph_connection.call_args.kwargs.get("auth_method"), "Certificate"
-        )
-        self.assertEqual(
-            mock_graph_connection.call_args.kwargs.get("certificate_path"),
-            "/tmp/msgraph-cert.pem",
-        )
-        self.assertEqual(
-            mock_graph_connection.call_args.kwargs.get("certificate_password"),
-            "cert-pass",
-        )
-
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.MSGraphConnection")
-    @patch("parsedmarc.cli.logger")
-    def testCliRequiresMsGraphCertificatePath(
-        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
-    ):
-        config_text = """[general]
-silent = true
-
-[msgraph]
-auth_method = Certificate
-client_id = client-id
-tenant_id = tenant-id
-mailbox = shared@example.com
-"""
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(config_text)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            with self.assertRaises(SystemExit) as system_exit:
-                parsedmarc.cli._main()
-
-        self.assertEqual(system_exit.exception.code, -1)
-        mock_logger.critical.assert_called_once_with(
-            "certificate_path setting missing from the msgraph config section"
-        )
-        mock_graph_connection.assert_not_called()
-        mock_get_mailbox_reports.assert_not_called()
-
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.MSGraphConnection")
-    def testCliUsesMsGraphUserAsMailboxForUsernamePasswordAuth(
-        self, mock_graph_connection, mock_get_mailbox_reports
-    ):
-        mock_graph_connection.return_value = object()
-        mock_get_mailbox_reports.return_value = {
-            "aggregate_reports": [],
-            "failure_reports": [],
-            "smtp_tls_reports": [],
-        }
-
-        config_text = """[general]
-silent = true
-
-[msgraph]
-auth_method = UsernamePassword
-client_id = client-id
-client_secret = client-secret
-user = owner@example.com
-password = test-password
-"""
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(config_text)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            parsedmarc.cli._main()
-
-        self.assertEqual(
-            mock_graph_connection.call_args.kwargs.get("mailbox"),
-            "owner@example.com",
-        )
-        self.assertEqual(
-            mock_graph_connection.call_args.kwargs.get("username"),
-            "owner@example.com",
-        )
-
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.MSGraphConnection")
-    @patch("parsedmarc.cli.logger")
-    def testCliRequiresMsGraphPasswordForUsernamePasswordAuth(
-        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
-    ):
-        config_text = """[general]
-silent = true
-
-[msgraph]
-auth_method = UsernamePassword
-client_id = client-id
-client_secret = client-secret
-user = owner@example.com
-"""
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(config_text)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            with self.assertRaises(SystemExit) as system_exit:
-                parsedmarc.cli._main()
-
-        self.assertEqual(system_exit.exception.code, -1)
-        mock_logger.critical.assert_called_once_with(
-            "password setting missing from the msgraph config section"
-        )
-        mock_graph_connection.assert_not_called()
-        mock_get_mailbox_reports.assert_not_called()
-
-
-class TestMSGraphCliValidation(unittest.TestCase):
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.MSGraphConnection")
-    def testCliPassesMsGraphClientSecretAuthSettings(
-        self, mock_graph_connection, mock_get_mailbox_reports
-    ):
-        mock_graph_connection.return_value = object()
-        mock_get_mailbox_reports.return_value = {
-            "aggregate_reports": [],
-            "failure_reports": [],
-            "smtp_tls_reports": [],
-        }
-
-        config_text = """[general]
-silent = true
-
-[msgraph]
-auth_method = ClientSecret
-client_id = client-id
-client_secret = client-secret
-tenant_id = tenant-id
-mailbox = shared@example.com
-"""
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(config_text)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            parsedmarc.cli._main()
-
-        self.assertEqual(
-            mock_graph_connection.call_args.kwargs.get("auth_method"), "ClientSecret"
-        )
-        self.assertEqual(
-            mock_graph_connection.call_args.kwargs.get("client_secret"),
-            "client-secret",
-        )
-        self.assertEqual(
-            mock_graph_connection.call_args.kwargs.get("tenant_id"), "tenant-id"
-        )
-        self.assertEqual(
-            mock_graph_connection.call_args.kwargs.get("mailbox"),
-            "shared@example.com",
-        )
-
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.MSGraphConnection")
-    @patch("parsedmarc.cli.logger")
-    def testCliRequiresMsGraphClientSecretForClientSecretAuth(
-        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
-    ):
-        config_text = """[general]
-silent = true
-
-[msgraph]
-auth_method = ClientSecret
-client_id = client-id
-tenant_id = tenant-id
-mailbox = shared@example.com
-"""
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(config_text)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            with self.assertRaises(SystemExit) as system_exit:
-                parsedmarc.cli._main()
-
-        self.assertEqual(system_exit.exception.code, -1)
-        mock_logger.critical.assert_called_once_with(
-            "client_secret setting missing from the msgraph config section"
-        )
-        mock_graph_connection.assert_not_called()
-        mock_get_mailbox_reports.assert_not_called()
-
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.MSGraphConnection")
-    @patch("parsedmarc.cli.logger")
-    def testCliRequiresMsGraphTenantIdForClientSecretAuth(
-        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
-    ):
-        config_text = """[general]
-silent = true
-
-[msgraph]
-auth_method = ClientSecret
-client_id = client-id
-client_secret = client-secret
-mailbox = shared@example.com
-"""
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(config_text)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            with self.assertRaises(SystemExit) as system_exit:
-                parsedmarc.cli._main()
-
-        self.assertEqual(system_exit.exception.code, -1)
-        mock_logger.critical.assert_called_once_with(
-            "tenant_id setting missing from the msgraph config section"
-        )
-        mock_graph_connection.assert_not_called()
-        mock_get_mailbox_reports.assert_not_called()
-
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.MSGraphConnection")
-    @patch("parsedmarc.cli.logger")
-    def testCliRequiresMsGraphMailboxForClientSecretAuth(
-        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
-    ):
-        config_text = """[general]
-silent = true
-
-[msgraph]
-auth_method = ClientSecret
-client_id = client-id
-client_secret = client-secret
-tenant_id = tenant-id
-"""
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(config_text)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            with self.assertRaises(SystemExit) as system_exit:
-                parsedmarc.cli._main()
-
-        self.assertEqual(system_exit.exception.code, -1)
-        mock_logger.critical.assert_called_once_with(
-            "mailbox setting missing from the msgraph config section"
-        )
-        mock_graph_connection.assert_not_called()
-        mock_get_mailbox_reports.assert_not_called()
-
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.MSGraphConnection")
-    def testCliAllowsMsGraphDeviceCodeWithoutUser(
-        self, mock_graph_connection, mock_get_mailbox_reports
-    ):
-        mock_graph_connection.return_value = object()
-        mock_get_mailbox_reports.return_value = {
-            "aggregate_reports": [],
-            "failure_reports": [],
-            "smtp_tls_reports": [],
-        }
-
-        config_text = """[general]
-silent = true
-
-[msgraph]
-auth_method = DeviceCode
-client_id = client-id
-tenant_id = tenant-id
-mailbox = shared@example.com
-"""
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(config_text)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            parsedmarc.cli._main()
-
-        self.assertEqual(
-            mock_graph_connection.call_args.kwargs.get("auth_method"), "DeviceCode"
-        )
-        self.assertEqual(
-            mock_graph_connection.call_args.kwargs.get("mailbox"),
-            "shared@example.com",
-        )
-        self.assertIsNone(mock_graph_connection.call_args.kwargs.get("username"))
-
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.MSGraphConnection")
-    @patch("parsedmarc.cli.logger")
-    def testCliRequiresMsGraphTenantIdForDeviceCodeAuth(
-        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
-    ):
-        config_text = """[general]
-silent = true
-
-[msgraph]
-auth_method = DeviceCode
-client_id = client-id
-mailbox = shared@example.com
-"""
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(config_text)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            with self.assertRaises(SystemExit) as system_exit:
-                parsedmarc.cli._main()
-
-        self.assertEqual(system_exit.exception.code, -1)
-        mock_logger.critical.assert_called_once_with(
-            "tenant_id setting missing from the msgraph config section"
-        )
-        mock_graph_connection.assert_not_called()
-        mock_get_mailbox_reports.assert_not_called()
-
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.MSGraphConnection")
-    @patch("parsedmarc.cli.logger")
-    def testCliRequiresMsGraphMailboxForDeviceCodeAuth(
-        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
-    ):
-        config_text = """[general]
-silent = true
-
-[msgraph]
-auth_method = DeviceCode
-client_id = client-id
-tenant_id = tenant-id
-"""
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(config_text)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            with self.assertRaises(SystemExit) as system_exit:
-                parsedmarc.cli._main()
-
-        self.assertEqual(system_exit.exception.code, -1)
-        mock_logger.critical.assert_called_once_with(
-            "mailbox setting missing from the msgraph config section"
-        )
-        mock_graph_connection.assert_not_called()
-        mock_get_mailbox_reports.assert_not_called()
-
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.MSGraphConnection")
-    @patch("parsedmarc.cli.logger")
-    def testCliRequiresMsGraphTenantIdForCertificateAuth(
-        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
-    ):
-        config_text = """[general]
-silent = true
-
-[msgraph]
-auth_method = Certificate
-client_id = client-id
-mailbox = shared@example.com
-certificate_path = /tmp/msgraph-cert.pem
-"""
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(config_text)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            with self.assertRaises(SystemExit) as system_exit:
-                parsedmarc.cli._main()
-
-        self.assertEqual(system_exit.exception.code, -1)
-        mock_logger.critical.assert_called_once_with(
-            "tenant_id setting missing from the msgraph config section"
-        )
-        mock_graph_connection.assert_not_called()
-        mock_get_mailbox_reports.assert_not_called()
-
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.MSGraphConnection")
-    @patch("parsedmarc.cli.logger")
-    def testCliRequiresMsGraphMailboxForCertificateAuth(
-        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
-    ):
-        config_text = """[general]
-silent = true
-
-[msgraph]
-auth_method = Certificate
-client_id = client-id
-tenant_id = tenant-id
-certificate_path = /tmp/msgraph-cert.pem
-"""
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(config_text)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            with self.assertRaises(SystemExit) as system_exit:
-                parsedmarc.cli._main()
-
-        self.assertEqual(system_exit.exception.code, -1)
-        mock_logger.critical.assert_called_once_with(
-            "mailbox setting missing from the msgraph config section"
-        )
-        mock_graph_connection.assert_not_called()
-        mock_get_mailbox_reports.assert_not_called()
-
-
-class TestSighupReload(unittest.TestCase):
-    """Tests for SIGHUP-driven configuration reload in watch mode."""
-
-    def setUp(self):
-        from parsedmarc.log import logger as _logger
-
-        _logger.disabled = True
-        self._stdout_patch = patch("sys.stdout", new_callable=io.StringIO)
-        self._stderr_patch = patch("sys.stderr", new_callable=io.StringIO)
-        self._stdout_patch.start()
-        self._stderr_patch.start()
-
-    def tearDown(self):
-        from parsedmarc.log import logger as _logger
-
-        _logger.disabled = False
-        self._stderr_patch.stop()
-        self._stdout_patch.stop()
-
-    _BASE_CONFIG = """[general]
-silent = true
-
-[imap]
-host = imap.example.com
-user = user
-password = pass
-
-[mailbox]
-watch = true
-"""
-
-    @unittest.skipUnless(
-        hasattr(signal, "SIGHUP"),
-        "SIGHUP not available on this platform",
-    )
-    @patch("parsedmarc.cli._init_output_clients")
-    @patch("parsedmarc.cli._parse_config")
-    @patch("parsedmarc.cli._load_config")
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.watch_inbox")
-    @patch("parsedmarc.cli.IMAPConnection")
-    def testSighupTriggersReloadAndWatchRestarts(
-        self,
-        mock_imap,
-        mock_watch,
-        mock_get_reports,
-        mock_load_config,
-        mock_parse_config,
-        mock_init_clients,
-    ):
-        """SIGHUP causes watch to return, config is re-parsed, and watch restarts."""
-        import signal as signal_module
-
-        mock_imap.return_value = object()
-        mock_get_reports.return_value = {
-            "aggregate_reports": [],
-            "failure_reports": [],
-            "smtp_tls_reports": [],
-        }
-
-        mock_load_config.return_value = ConfigParser()
-
-        def parse_side_effect(config, opts):
-            opts.imap_host = "imap.example.com"
-            opts.imap_user = "user"
-            opts.imap_password = "pass"
-            opts.mailbox_watch = True
-            return None
-
-        mock_parse_config.side_effect = parse_side_effect
-        mock_init_clients.return_value = {}
-
-        call_count = [0]
-
-        def watch_side_effect(*args, **kwargs):
-            call_count[0] += 1
-            if call_count[0] == 1:
-                # Simulate SIGHUP arriving while watch is running
-                if hasattr(signal_module, "SIGHUP"):
-                    import os
-
-                    os.kill(os.getpid(), signal_module.SIGHUP)
-                return  # Normal return — reload loop will continue
-            else:
-                raise FileExistsError("stop-watch-loop")
-
-        mock_watch.side_effect = watch_side_effect
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(self._BASE_CONFIG)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            with self.assertRaises(SystemExit) as cm:
-                parsedmarc.cli._main()
-
-        # Exited with code 1 (from FileExistsError handler)
-        self.assertEqual(cm.exception.code, 1)
-        # watch_inbox was called twice: initial run + after reload
-        self.assertEqual(mock_watch.call_count, 2)
-        # _parse_config called for initial load + reload
-        self.assertGreaterEqual(mock_parse_config.call_count, 2)
-
-    @unittest.skipUnless(
-        hasattr(signal, "SIGHUP"),
-        "SIGHUP not available on this platform",
-    )
-    @patch("parsedmarc.cli._init_output_clients")
-    @patch("parsedmarc.cli._parse_config")
-    @patch("parsedmarc.cli._load_config")
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.watch_inbox")
-    @patch("parsedmarc.cli.IMAPConnection")
-    def testInvalidConfigOnReloadKeepsPreviousState(
-        self,
-        mock_imap,
-        mock_watch,
-        mock_get_reports,
-        mock_load_config,
-        mock_parse_config,
-        mock_init_clients,
-    ):
-        """A failing reload leaves opts and clients unchanged."""
-        import signal as signal_module
-
-        mock_imap.return_value = object()
-        mock_get_reports.return_value = {
-            "aggregate_reports": [],
-            "failure_reports": [],
-            "smtp_tls_reports": [],
-        }
-
-        mock_load_config.return_value = ConfigParser()
-
-        # Initial parse sets required opts; reload parse raises
-        initial_map = {"prefix_": ["example.com"]}
-        call_count = [0]
-
-        def parse_side_effect(config, opts):
-            call_count[0] += 1
-            opts.imap_host = "imap.example.com"
-            opts.imap_user = "user"
-            opts.imap_password = "pass"
-            opts.mailbox_watch = True
-            if call_count[0] == 1:
-                return initial_map
-            raise RuntimeError("bad config")
-
-        mock_parse_config.side_effect = parse_side_effect
-
-        initial_clients = {"s3_client": MagicMock()}
-        mock_init_clients.return_value = initial_clients
-
-        watch_calls = [0]
-
-        def watch_side_effect(*args, **kwargs):
-            watch_calls[0] += 1
-            if watch_calls[0] == 1:
-                if hasattr(signal_module, "SIGHUP"):
-                    import os
-
-                    os.kill(os.getpid(), signal_module.SIGHUP)
-                return
-            else:
-                raise FileExistsError("stop")
-
-        mock_watch.side_effect = watch_side_effect
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(self._BASE_CONFIG)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            with self.assertRaises(SystemExit) as cm:
-                parsedmarc.cli._main()
-
-        self.assertEqual(cm.exception.code, 1)
-        # watch was still called twice (reload loop continued after failed reload)
-        self.assertEqual(mock_watch.call_count, 2)
-        # The failed reload must not have closed the original clients
-        initial_clients["s3_client"].close.assert_not_called()
-
-    @unittest.skipUnless(
-        hasattr(signal, "SIGHUP"),
-        "SIGHUP not available on this platform",
-    )
-    @patch("parsedmarc.cli._init_output_clients")
-    @patch("parsedmarc.cli._parse_config")
-    @patch("parsedmarc.cli._load_config")
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.watch_inbox")
-    @patch("parsedmarc.cli.IMAPConnection")
-    def testReloadClosesOldClients(
-        self,
-        mock_imap,
-        mock_watch,
-        mock_get_reports,
-        mock_load_config,
-        mock_parse_config,
-        mock_init_clients,
-    ):
-        """Successful reload closes the old output clients before replacing them."""
-        import signal as signal_module
-
-        mock_imap.return_value = object()
-        mock_get_reports.return_value = {
-            "aggregate_reports": [],
-            "failure_reports": [],
-            "smtp_tls_reports": [],
-        }
-
-        mock_load_config.return_value = ConfigParser()
-
-        def parse_side_effect(config, opts):
-            opts.imap_host = "imap.example.com"
-            opts.imap_user = "user"
-            opts.imap_password = "pass"
-            opts.mailbox_watch = True
-            return None
-
-        mock_parse_config.side_effect = parse_side_effect
-
-        old_client = MagicMock()
-        new_client = MagicMock()
-        init_call = [0]
-
-        def init_side_effect(opts):
-            init_call[0] += 1
-            if init_call[0] == 1:
-                return {"kafka_client": old_client}
-            return {"kafka_client": new_client}
-
-        mock_init_clients.side_effect = init_side_effect
-
-        watch_calls = [0]
-
-        def watch_side_effect(*args, **kwargs):
-            watch_calls[0] += 1
-            if watch_calls[0] == 1:
-                if hasattr(signal_module, "SIGHUP"):
-                    import os
-
-                    os.kill(os.getpid(), signal_module.SIGHUP)
-                return
-            else:
-                raise FileExistsError("stop")
-
-        mock_watch.side_effect = watch_side_effect
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(self._BASE_CONFIG)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            with self.assertRaises(SystemExit):
-                parsedmarc.cli._main()
-
-        # Old client must have been closed when reload succeeded
-        old_client.close.assert_called_once()
-
-    @unittest.skipUnless(
-        hasattr(signal, "SIGHUP"),
-        "SIGHUP not available on this platform",
-    )
-    @patch("parsedmarc.cli._init_output_clients")
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.watch_inbox")
-    @patch("parsedmarc.cli.IMAPConnection")
-    def testRemovedConfigSectionTakesEffectOnReload(
-        self,
-        mock_imap,
-        mock_watch,
-        mock_get_reports,
-        mock_init_clients,
-    ):
-        """Removing a config section on reload resets that option to its default."""
-        import signal as signal_module
-
-        mock_imap.return_value = object()
-        mock_get_reports.return_value = {
-            "aggregate_reports": [],
-            "failure_reports": [],
-            "smtp_tls_reports": [],
-        }
-        mock_init_clients.return_value = {}
-
-        # First config sets kafka_hosts (with required topics); second removes it.
-        config_v1 = (
-            self._BASE_CONFIG
-            + "\n[kafka]\nhosts = kafka.example.com:9092\n"
-            + "aggregate_topic = dmarc_agg\n"
-            + "forensic_topic = dmarc_forensic\n"
-            + "smtp_tls_topic = smtp_tls\n"
-        )
-        config_v2 = self._BASE_CONFIG  # no [kafka] section
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(config_v1)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        watch_calls = [0]
-
-        def watch_side_effect(*args, **kwargs):
-            watch_calls[0] += 1
-            if watch_calls[0] == 1:
-                # Rewrite config to remove kafka before triggering reload
-                with open(cfg_path, "w") as f:
-                    f.write(config_v2)
-                if hasattr(signal_module, "SIGHUP"):
-                    import os
-
-                    os.kill(os.getpid(), signal_module.SIGHUP)
-                return
-            else:
-                raise FileExistsError("stop")
-
-        mock_watch.side_effect = watch_side_effect
-
-        # Capture opts used on each _init_output_clients call
-        init_opts_captures = []
-
-        def init_side_effect(opts):
-            from argparse import Namespace as NS
-
-            init_opts_captures.append(NS(**vars(opts)))
-            return {}
-
-        mock_init_clients.side_effect = init_side_effect
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            with self.assertRaises(SystemExit):
-                parsedmarc.cli._main()
-
-        # First init: kafka_hosts should be set from v1 config
-        self.assertIsNotNone(init_opts_captures[0].kafka_hosts)
-        # Second init (after reload with v2 config): kafka_hosts should be None
-        self.assertIsNone(init_opts_captures[1].kafka_hosts)
-
-    @unittest.skipUnless(
-        hasattr(signal, "SIGHUP"),
-        "SIGHUP not available on this platform",
-    )
-    @patch("parsedmarc.cli._init_output_clients")
-    @patch("parsedmarc.cli._parse_config")
-    @patch("parsedmarc.cli._load_config")
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.watch_inbox")
-    @patch("parsedmarc.cli.IMAPConnection")
-    def testReloadRefreshesReverseDnsMap(
-        self,
-        mock_imap,
-        mock_watch,
-        mock_get_reports,
-        mock_load_config,
-        mock_parse_config,
-        mock_init_clients,
-    ):
-        """SIGHUP reload repopulates the reverse DNS map so lookups still work."""
-        import signal as signal_module
-
-        from parsedmarc import REVERSE_DNS_MAP
-
-        mock_imap.return_value = object()
-        mock_get_reports.return_value = {
-            "aggregate_reports": [],
-            "failure_reports": [],
-            "smtp_tls_reports": [],
-        }
-
-        mock_load_config.return_value = ConfigParser()
-
-        def parse_side_effect(config, opts):
-            opts.imap_host = "imap.example.com"
-            opts.imap_user = "user"
-            opts.imap_password = "pass"
-            opts.mailbox_watch = True
-            return None
-
-        mock_parse_config.side_effect = parse_side_effect
-        mock_init_clients.return_value = {}
-
-        # Snapshot the map state after each watch_inbox call
-        map_snapshots = []
-
-        watch_calls = [0]
-
-        def watch_side_effect(*args, **kwargs):
-            watch_calls[0] += 1
-            if watch_calls[0] == 1:
-                if hasattr(signal_module, "SIGHUP"):
-                    import os
-
-                    os.kill(os.getpid(), signal_module.SIGHUP)
-                return
-            else:
-                # Capture the map state after reload, before we stop the loop
-                map_snapshots.append(dict(REVERSE_DNS_MAP))
-                raise FileExistsError("stop")
-
-        mock_watch.side_effect = watch_side_effect
-
-        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
-            cfg.write(self._BASE_CONFIG)
-            cfg_path = cfg.name
-        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
-
-        # Pre-populate the map so we can verify it gets refreshed
-        REVERSE_DNS_MAP.clear()
-        REVERSE_DNS_MAP["stale.example.com"] = {
-            "name": "Stale",
-            "type": "stale",
-        }
-        original_contents = dict(REVERSE_DNS_MAP)
-
-        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
-            with self.assertRaises(SystemExit):
-                parsedmarc.cli._main()
-
-        self.assertEqual(mock_watch.call_count, 2)
-        # The map should have been repopulated (not empty, not the stale data)
-        self.assertEqual(len(map_snapshots), 1)
-        refreshed = map_snapshots[0]
-        self.assertGreater(len(refreshed), 0, "Map should not be empty after reload")
-        self.assertNotEqual(
-            refreshed,
-            original_contents,
-            "Map should have been refreshed, not kept stale data",
-        )
-        self.assertNotIn(
-            "stale.example.com",
-            refreshed,
-            "Stale entry should have been cleared by reload",
-        )
-
-
-class TestIndexPrefixDomainMapTlsFiltering(unittest.TestCase):
-    """Tests that SMTP TLS reports for unmapped domains are filtered out
-    when index_prefix_domain_map is configured."""
-
-    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
-    @patch("parsedmarc.cli.IMAPConnection")
-    def testTlsReportsFilteredByDomainMap(
-        self,
-        mock_imap_connection,
-        mock_get_reports,
-    ):
-        """TLS reports for domains not in the map should be silently dropped."""
-        mock_imap_connection.return_value = object()
-        mock_get_reports.return_value = {
-            "aggregate_reports": [],
-            "failure_reports": [],
-            "smtp_tls_reports": [
-                {
-                    "organization_name": "Allowed Org",
-                    "begin_date": "2024-01-01T00:00:00Z",
-                    "end_date": "2024-01-01T23:59:59Z",
-                    "report_id": "allowed-1",
-                    "contact_info": "tls@allowed.example.com",
-                    "policies": [
-                        {
-                            "policy_domain": "allowed.example.com",
-                            "policy_type": "sts",
-                            "successful_session_count": 1,
-                            "failed_session_count": 0,
-                        }
-                    ],
-                },
-                {
-                    "organization_name": "Unmapped Org",
-                    "begin_date": "2024-01-01T00:00:00Z",
-                    "end_date": "2024-01-01T23:59:59Z",
-                    "report_id": "unmapped-1",
-                    "contact_info": "tls@unmapped.example.net",
-                    "policies": [
-                        {
-                            "policy_domain": "unmapped.example.net",
-                            "policy_type": "sts",
-                            "successful_session_count": 5,
-                            "failed_session_count": 0,
-                        }
-                    ],
-                },
-                {
-                    "organization_name": "Mixed Case Org",
-                    "begin_date": "2024-01-01T00:00:00Z",
-                    "end_date": "2024-01-01T23:59:59Z",
-                    "report_id": "mixed-case-1",
-                    "contact_info": "tls@mixedcase.example.com",
-                    "policies": [
-                        {
-                            "policy_domain": "MixedCase.Example.Com",
-                            "policy_type": "sts",
-                            "successful_session_count": 2,
-                            "failed_session_count": 0,
-                        }
-                    ],
-                },
-            ],
-        }
-
-        domain_map = {"tenant_a": ["example.com"]}
-        with NamedTemporaryFile("w", suffix=".yaml", delete=False) as map_file:
-            import yaml
-
-            yaml.dump(domain_map, map_file)
-            map_path = map_file.name
-        self.addCleanup(lambda: os.path.exists(map_path) and os.remove(map_path))
-
-        config = f"""[general]
-save_smtp_tls = true
-silent = false
-index_prefix_domain_map = {map_path}
-
-[imap]
-host = imap.example.com
-user = test-user
-password = test-password
-"""
-        with NamedTemporaryFile("w", suffix=".ini", delete=False) as config_file:
-            config_file.write(config)
-            config_path = config_file.name
-        self.addCleanup(lambda: os.path.exists(config_path) and os.remove(config_path))
-
-        captured = io.StringIO()
-        with patch.object(sys, "argv", ["parsedmarc", "-c", config_path]):
-            with patch("sys.stdout", captured):
-                parsedmarc.cli._main()
-
-        output = json.loads(captured.getvalue())
-        tls_reports = output["smtp_tls_reports"]
-        self.assertEqual(len(tls_reports), 2)
-        report_ids = {r["report_id"] for r in tls_reports}
-        self.assertIn("allowed-1", report_ids)
-        self.assertIn("mixed-case-1", report_ids)
-        self.assertNotIn("unmapped-1", report_ids)
-
-
-class TestConfigAliases(unittest.TestCase):
-    """Tests for config key aliases (env var friendly short names)."""
-
-    def test_maildir_create_alias(self):
-        """[maildir] create works as alias for maildir_create."""
-        from argparse import Namespace
-        from parsedmarc.cli import _load_config, _parse_config
-
-        env = {
-            "PARSEDMARC_MAILDIR_CREATE": "true",
-            "PARSEDMARC_MAILDIR_PATH": "/tmp/test",
-        }
-        with patch.dict(os.environ, env, clear=False):
-            config = _load_config(None)
-        opts = Namespace()
-        _parse_config(config, opts)
-        self.assertTrue(opts.maildir_create)
-
-    def test_maildir_path_alias(self):
-        """[maildir] path works as alias for maildir_path."""
-        from argparse import Namespace
-        from parsedmarc.cli import _load_config, _parse_config
-
-        env = {"PARSEDMARC_MAILDIR_PATH": "/var/mail/dmarc"}
-        with patch.dict(os.environ, env, clear=False):
-            config = _load_config(None)
-        opts = Namespace()
-        _parse_config(config, opts)
-        self.assertEqual(opts.maildir_path, "/var/mail/dmarc")
-
-    def test_msgraph_url_alias(self):
-        """[msgraph] url works as alias for graph_url."""
-        from parsedmarc.cli import _load_config, _parse_config
-        from argparse import Namespace
-
-        env = {
-            "PARSEDMARC_MSGRAPH_AUTH_METHOD": "ClientSecret",
-            "PARSEDMARC_MSGRAPH_CLIENT_ID": "test-id",
-            "PARSEDMARC_MSGRAPH_CLIENT_SECRET": "test-secret",
-            "PARSEDMARC_MSGRAPH_TENANT_ID": "test-tenant",
-            "PARSEDMARC_MSGRAPH_MAILBOX": "test@example.com",
-            "PARSEDMARC_MSGRAPH_URL": "https://custom.graph.example.com",
-        }
-        with patch.dict(os.environ, env, clear=False):
-            config = _load_config(None)
-        opts = Namespace()
-        _parse_config(config, opts)
-        self.assertEqual(opts.graph_url, "https://custom.graph.example.com")
-
-    def test_original_keys_still_work(self):
-        """Original INI key names (maildir_create, maildir_path) still work."""
-        from argparse import Namespace
-        from parsedmarc.cli import _parse_config
-
-        config = ConfigParser(interpolation=None)
-        config.add_section("maildir")
-        config.set("maildir", "maildir_path", "/original/path")
-        config.set("maildir", "maildir_create", "true")
-
-        opts = Namespace()
-        _parse_config(config, opts)
-        self.assertEqual(opts.maildir_path, "/original/path")
-        self.assertTrue(opts.maildir_create)
-
-    def test_ipinfo_url_option(self):
-        """[general] ipinfo_url lands on opts.ipinfo_url."""
-        from argparse import Namespace
-        from parsedmarc.cli import _parse_config
-
-        config = ConfigParser(interpolation=None)
-        config.add_section("general")
-        config.set("general", "ipinfo_url", "https://mirror.example/mmdb")
-
-        opts = Namespace()
-        _parse_config(config, opts)
-        self.assertEqual(opts.ipinfo_url, "https://mirror.example/mmdb")
-
-    def test_ip_db_url_deprecated_alias(self):
-        """[general] ip_db_url is accepted as an alias for ipinfo_url but
-        emits a deprecation warning."""
-        from argparse import Namespace
-        from parsedmarc.cli import _parse_config
-
-        config = ConfigParser(interpolation=None)
-        config.add_section("general")
-        config.set("general", "ip_db_url", "https://old.example/mmdb")
-
-        opts = Namespace()
-        with self.assertLogs("parsedmarc.log", level="WARNING") as cm:
-            _parse_config(config, opts)
-        self.assertEqual(opts.ipinfo_url, "https://old.example/mmdb")
-        self.assertTrue(
-            any("ip_db_url" in line and "deprecated" in line for line in cm.output),
-            f"expected deprecation warning, got: {cm.output}",
-        )
-
-
-class TestExpandPath(unittest.TestCase):
-    """Tests for _expand_path config path expansion."""
-
-    def test_expand_tilde(self):
-        from parsedmarc.cli import _expand_path
-
-        result = _expand_path("~/some/path")
-        self.assertFalse(result.startswith("~"))
-        self.assertTrue(result.endswith("/some/path"))
-
-    def test_expand_env_var(self):
-        from parsedmarc.cli import _expand_path
-
-        with patch.dict(os.environ, {"PARSEDMARC_TEST_DIR": "/opt/data"}):
-            result = _expand_path("$PARSEDMARC_TEST_DIR/tokens/.token")
-        self.assertEqual(result, "/opt/data/tokens/.token")
-
-    def test_expand_both(self):
-        from parsedmarc.cli import _expand_path
-
-        with patch.dict(os.environ, {"MY_APP": "parsedmarc"}):
-            result = _expand_path("~/$MY_APP/config")
-        self.assertNotIn("~", result)
-        self.assertIn("parsedmarc/config", result)
-
-    def test_no_expansion_needed(self):
-        from parsedmarc.cli import _expand_path
-
-        self.assertEqual(_expand_path("/absolute/path"), "/absolute/path")
-        self.assertEqual(_expand_path("relative/path"), "relative/path")
-
-
-class TestEnvVarConfig(unittest.TestCase):
-    """Tests for environment variable configuration support."""
-
-    def test_resolve_section_key_simple(self):
-        """Simple section names resolve correctly."""
-        from parsedmarc.cli import _resolve_section_key
-
-        self.assertEqual(_resolve_section_key("IMAP_PASSWORD"), ("imap", "password"))
-        self.assertEqual(_resolve_section_key("GENERAL_DEBUG"), ("general", "debug"))
-        self.assertEqual(_resolve_section_key("S3_BUCKET"), ("s3", "bucket"))
-        self.assertEqual(_resolve_section_key("GELF_HOST"), ("gelf", "host"))
-
-    def test_resolve_section_key_underscore_sections(self):
-        """Multi-word section names (splunk_hec, gmail_api, etc.) resolve correctly."""
-        from parsedmarc.cli import _resolve_section_key
-
-        self.assertEqual(
-            _resolve_section_key("SPLUNK_HEC_TOKEN"), ("splunk_hec", "token")
-        )
-        self.assertEqual(
-            _resolve_section_key("GMAIL_API_CREDENTIALS_FILE"),
-            ("gmail_api", "credentials_file"),
-        )
-        self.assertEqual(
-            _resolve_section_key("LOG_ANALYTICS_CLIENT_ID"),
-            ("log_analytics", "client_id"),
-        )
-
-    def test_resolve_section_key_unknown(self):
-        """Unknown prefixes return (None, None)."""
-        from parsedmarc.cli import _resolve_section_key
-
-        self.assertEqual(_resolve_section_key("UNKNOWN_FOO"), (None, None))
-        # Just a section name with no key should not match
-        self.assertEqual(_resolve_section_key("IMAP"), (None, None))
-
-    def test_apply_env_overrides_injects_values(self):
-        """Env vars are injected into an existing ConfigParser."""
-        from configparser import ConfigParser
-        from parsedmarc.cli import _apply_env_overrides
-
-        config = ConfigParser()
-        config.add_section("imap")
-        config.set("imap", "host", "original.example.com")
-
-        env = {
-            "PARSEDMARC_IMAP_HOST": "new.example.com",
-            "PARSEDMARC_IMAP_PASSWORD": "secret123",
-        }
-        with patch.dict(os.environ, env, clear=False):
-            _apply_env_overrides(config)
-
-        self.assertEqual(config.get("imap", "host"), "new.example.com")
-        self.assertEqual(config.get("imap", "password"), "secret123")
-
-    def test_apply_env_overrides_creates_sections(self):
-        """Env vars create new sections when they don't exist."""
-        from configparser import ConfigParser
-        from parsedmarc.cli import _apply_env_overrides
-
-        config = ConfigParser()
-
-        env = {"PARSEDMARC_ELASTICSEARCH_HOSTS": "http://localhost:9200"}
-        with patch.dict(os.environ, env, clear=False):
-            _apply_env_overrides(config)
-
-        self.assertTrue(config.has_section("elasticsearch"))
-        self.assertEqual(config.get("elasticsearch", "hosts"), "http://localhost:9200")
-
-    def test_apply_env_overrides_ignores_config_file_var(self):
-        """PARSEDMARC_CONFIG_FILE is not injected as a config key."""
-        from configparser import ConfigParser
-        from parsedmarc.cli import _apply_env_overrides
-
-        config = ConfigParser()
-
-        env = {"PARSEDMARC_CONFIG_FILE": "/some/path.ini"}
-        with patch.dict(os.environ, env, clear=False):
-            _apply_env_overrides(config)
-
-        self.assertEqual(config.sections(), [])
-
-    def test_load_config_with_file_and_env_override(self):
-        """Env vars override values from an INI file."""
-        from parsedmarc.cli import _load_config
-
-        with NamedTemporaryFile(mode="w", suffix=".ini", delete=False) as f:
-            f.write(
-                "[imap]\nhost = file.example.com\nuser = alice\npassword = fromfile\n"
-            )
-            f.flush()
-            config_path = f.name
-
-        try:
-            env = {"PARSEDMARC_IMAP_PASSWORD": "fromenv"}
-            with patch.dict(os.environ, env, clear=False):
-                config = _load_config(config_path)
-
-            self.assertEqual(config.get("imap", "host"), "file.example.com")
-            self.assertEqual(config.get("imap", "user"), "alice")
-            self.assertEqual(config.get("imap", "password"), "fromenv")
-        finally:
-            os.unlink(config_path)
-
-    def test_load_config_env_only(self):
-        """Config can be loaded purely from env vars with no file."""
-        from parsedmarc.cli import _load_config
-
-        env = {
-            "PARSEDMARC_GENERAL_DEBUG": "true",
-            "PARSEDMARC_ELASTICSEARCH_HOSTS": "http://localhost:9200",
-        }
-        with patch.dict(os.environ, env, clear=False):
-            config = _load_config(None)
-
-        self.assertEqual(config.get("general", "debug"), "true")
-        self.assertEqual(config.get("elasticsearch", "hosts"), "http://localhost:9200")
-
-    def test_parse_config_from_env(self):
-        """Full round-trip: env vars -> ConfigParser -> opts."""
-        from argparse import Namespace
-        from parsedmarc.cli import _load_config, _parse_config
-
-        env = {
-            "PARSEDMARC_GENERAL_DEBUG": "true",
-            "PARSEDMARC_GENERAL_SAVE_AGGREGATE": "true",
-            "PARSEDMARC_GENERAL_OFFLINE": "true",
-        }
-        with patch.dict(os.environ, env, clear=False):
-            config = _load_config(None)
-
-        opts = Namespace()
-        _parse_config(config, opts)
-
-        self.assertTrue(opts.debug)
-        self.assertTrue(opts.save_aggregate)
-        self.assertTrue(opts.offline)
-
-    def test_config_file_env_var(self):
-        """PARSEDMARC_CONFIG_FILE env var specifies the config file path."""
-        from argparse import Namespace
-        from parsedmarc.cli import _load_config, _parse_config
-
-        with NamedTemporaryFile(mode="w", suffix=".ini", delete=False) as f:
-            f.write("[general]\ndebug = true\noffline = true\n")
-            f.flush()
-            config_path = f.name
-
-        try:
-            env = {"PARSEDMARC_CONFIG_FILE": config_path}
-            with patch.dict(os.environ, env, clear=False):
-                config = _load_config(os.environ.get("PARSEDMARC_CONFIG_FILE"))
-
-            opts = Namespace()
-            _parse_config(config, opts)
-            self.assertTrue(opts.debug)
-            self.assertTrue(opts.offline)
-        finally:
-            os.unlink(config_path)
-
-    def test_boolean_values_from_env(self):
-        """Various boolean string representations work through ConfigParser."""
-        from configparser import ConfigParser
-        from parsedmarc.cli import _apply_env_overrides
-
-        for true_val in ("true", "yes", "1", "on", "True", "YES"):
-            config = ConfigParser()
-            env = {"PARSEDMARC_GENERAL_DEBUG": true_val}
-            with patch.dict(os.environ, env, clear=False):
-                _apply_env_overrides(config)
-            self.assertTrue(
-                config.getboolean("general", "debug"),
-                f"Expected truthy for {true_val!r}",
-            )
-
-        for false_val in ("false", "no", "0", "off", "False", "NO"):
-            config = ConfigParser()
-            env = {"PARSEDMARC_GENERAL_DEBUG": false_val}
-            with patch.dict(os.environ, env, clear=False):
-                _apply_env_overrides(config)
-            self.assertFalse(
-                config.getboolean("general", "debug"),
-                f"Expected falsy for {false_val!r}",
-            )
-
-    # ============================================================    # New tests for _bucket_interval_by_day
-    # ============================================================
-    def testBucketIntervalBeginAfterEnd(self):
-        """begin > end should raise ValueError"""
-        begin = datetime(2024, 1, 2, tzinfo=timezone.utc)
-        end = datetime(2024, 1, 1, tzinfo=timezone.utc)
-        with self.assertRaises(ValueError):
-            parsedmarc._bucket_interval_by_day(begin, end, 100)
-
-    def testBucketIntervalNaiveDatetime(self):
-        """Non-timezone-aware datetimes should raise ValueError"""
-        begin = datetime(2024, 1, 1)
-        end = datetime(2024, 1, 2)
-        with self.assertRaises(ValueError):
-            parsedmarc._bucket_interval_by_day(begin, end, 100)
-
-    def testBucketIntervalDifferentTzinfo(self):
-        """Different tzinfo objects should raise ValueError"""
-        tz1 = timezone.utc
-        tz2 = timezone(timedelta(hours=5))
-        begin = datetime(2024, 1, 1, tzinfo=tz1)
-        end = datetime(2024, 1, 2, tzinfo=tz2)
-        with self.assertRaises(ValueError):
-            parsedmarc._bucket_interval_by_day(begin, end, 100)
-
-    def testBucketIntervalNegativeCount(self):
-        """Negative total_count should raise ValueError"""
-        begin = datetime(2024, 1, 1, tzinfo=timezone.utc)
-        end = datetime(2024, 1, 2, tzinfo=timezone.utc)
-        with self.assertRaises(ValueError):
-            parsedmarc._bucket_interval_by_day(begin, end, -1)
-
-    def testBucketIntervalZeroCount(self):
-        """Zero total_count should return empty list"""
-        begin = datetime(2024, 1, 1, tzinfo=timezone.utc)
-        end = datetime(2024, 1, 2, tzinfo=timezone.utc)
-        result = parsedmarc._bucket_interval_by_day(begin, end, 0)
-        self.assertEqual(result, [])
-
-    def testBucketIntervalSameBeginEnd(self):
-        """Same begin and end (zero interval) should return empty list"""
-        dt = datetime(2024, 1, 1, 12, 0, 0, tzinfo=timezone.utc)
-        result = parsedmarc._bucket_interval_by_day(dt, dt, 100)
-        self.assertEqual(result, [])
-
-    def testBucketIntervalSingleDay(self):
-        """Single day interval should return one bucket with total count"""
-        begin = datetime(2024, 1, 1, 0, 0, 0, tzinfo=timezone.utc)
-        end = datetime(2024, 1, 1, 23, 59, 59, tzinfo=timezone.utc)
-        result = parsedmarc._bucket_interval_by_day(begin, end, 100)
-        self.assertEqual(len(result), 1)
-        self.assertEqual(result[0]["count"], 100)
-        self.assertEqual(result[0]["begin"], begin)
-
-    def testBucketIntervalMultiDay(self):
-        """Multi-day interval should distribute counts proportionally"""
-        begin = datetime(2024, 1, 1, 0, 0, 0, tzinfo=timezone.utc)
-        end = datetime(2024, 1, 3, 0, 0, 0, tzinfo=timezone.utc)
-        result = parsedmarc._bucket_interval_by_day(begin, end, 100)
-        self.assertEqual(len(result), 2)
-        total = sum(b["count"] for b in result)
-        self.assertEqual(total, 100)
-        # Equal days => equal distribution
-        self.assertEqual(result[0]["count"], 50)
-        self.assertEqual(result[1]["count"], 50)
-
-    def testBucketIntervalRemainderDistribution(self):
-        """Odd count across equal days distributes remainder correctly"""
-        begin = datetime(2024, 1, 1, 0, 0, 0, tzinfo=timezone.utc)
-        end = datetime(2024, 1, 4, 0, 0, 0, tzinfo=timezone.utc)
-        result = parsedmarc._bucket_interval_by_day(begin, end, 10)
-        total = sum(b["count"] for b in result)
-        self.assertEqual(total, 10)
-        self.assertEqual(len(result), 3)
-
-    def testBucketIntervalPartialDays(self):
-        """Partial days: 12h on day1, 24h on day2 => 1/3 vs 2/3 split"""
-        begin = datetime(2024, 1, 1, 12, 0, 0, tzinfo=timezone.utc)
-        end = datetime(2024, 1, 3, 0, 0, 0, tzinfo=timezone.utc)
-        result = parsedmarc._bucket_interval_by_day(begin, end, 90)
-        total = sum(b["count"] for b in result)
-        self.assertEqual(total, 90)
-        # day1: 12h, day2: 24h => 1/3 vs 2/3
-        self.assertEqual(result[0]["count"], 30)
-        self.assertEqual(result[1]["count"], 60)
-
-    # ============================================================    # Tests for _append_parsed_record
-    # ============================================================
-    def testAppendParsedRecordNoNormalize(self):
-        """No normalization: record appended as-is with interval fields"""
-        records = []
-        rec = {"count": 10, "source": {"ip_address": "1.2.3.4"}}
-        begin = datetime(2024, 1, 1, 0, 0, 0, tzinfo=timezone.utc)
-        end = datetime(2024, 1, 2, 0, 0, 0, tzinfo=timezone.utc)
-        parsedmarc._append_parsed_record(rec, records, begin, end, False)
-        self.assertEqual(len(records), 1)
-        self.assertFalse(records[0]["normalized_timespan"])  # type: ignore[typeddict-item]
-        self.assertEqual(records[0]["interval_begin"], "2024-01-01 00:00:00")
-        self.assertEqual(records[0]["interval_end"], "2024-01-02 00:00:00")
-
-    def testAppendParsedRecordNormalize(self):
-        """Normalization: record split into daily buckets"""
-        records = []
-        rec = {"count": 100, "source": {"ip_address": "1.2.3.4"}}
-        begin = datetime(2024, 1, 1, 0, 0, 0, tzinfo=timezone.utc)
-        end = datetime(2024, 1, 3, 0, 0, 0, tzinfo=timezone.utc)
-        parsedmarc._append_parsed_record(rec, records, begin, end, True)
-        self.assertEqual(len(records), 2)
-        total = sum(r["count"] for r in records)
-        self.assertEqual(total, 100)
-        for r in records:
-            self.assertTrue(r["normalized_timespan"])  # type: ignore[typeddict-item]
-
-    def testAppendParsedRecordNormalizeZeroCount(self):
-        """Normalization with zero count: nothing appended"""
-        records = []
-        rec = {"count": 0, "source": {"ip_address": "1.2.3.4"}}
-        begin = datetime(2024, 1, 1, 0, 0, 0, tzinfo=timezone.utc)
-        end = datetime(2024, 1, 3, 0, 0, 0, tzinfo=timezone.utc)
-        parsedmarc._append_parsed_record(rec, records, begin, end, True)
-        self.assertEqual(len(records), 0)
-
-    # ============================================================    # Tests for _parse_report_record
-    # ============================================================
-    def testParseReportRecordNoneSourceIP(self):
-        """Record with None source_ip should raise ValueError"""
-        record = {
-            "row": {
-                "source_ip": None,
-                "count": "1",
-                "policy_evaluated": {
-                    "disposition": "none",
-                    "dkim": "pass",
-                    "spf": "pass",
-                },
-            },
-            "identifiers": {"header_from": "example.com"},
-            "auth_results": {"dkim": [], "spf": []},
-        }
-        with self.assertRaises(ValueError):
-            parsedmarc._parse_report_record(record, offline=True)
-
-    def testParseReportRecordMissingDkimSpf(self):
-        """Record with missing dkim/spf auth results defaults correctly"""
-        record = {
-            "row": {
-                "source_ip": "192.0.2.1",
-                "count": "5",
-                "policy_evaluated": {
-                    "disposition": "none",
-                    "dkim": "pass",
-                    "spf": "fail",
-                },
-            },
-            "identifiers": {"header_from": "example.com"},
-            "auth_results": {},
-        }
-        result = parsedmarc._parse_report_record(record, offline=True)
-        self.assertEqual(result["auth_results"]["dkim"], [])
-        self.assertEqual(result["auth_results"]["spf"], [])
-
-    def testParseReportRecordReasonHandling(self):
-        """Reasons in policy_evaluated get normalized with comment default"""
-        record = {
-            "row": {
-                "source_ip": "192.0.2.1",
-                "count": "1",
-                "policy_evaluated": {
-                    "disposition": "none",
-                    "dkim": "pass",
-                    "spf": "pass",
-                    "reason": {"type": "forwarded"},
-                },
-            },
-            "identifiers": {"header_from": "example.com"},
-            "auth_results": {"dkim": [], "spf": []},
-        }
-        result = parsedmarc._parse_report_record(record, offline=True)
-        reasons = result["policy_evaluated"]["policy_override_reasons"]
-        self.assertEqual(len(reasons), 1)
-        self.assertEqual(reasons[0]["type"], "forwarded")
-        self.assertIsNone(reasons[0]["comment"])
-
-    def testParseReportRecordReasonList(self):
-        """Multiple reasons as a list are preserved"""
-        record = {
-            "row": {
-                "source_ip": "192.0.2.1",
-                "count": "1",
-                "policy_evaluated": {
-                    "disposition": "none",
-                    "dkim": "pass",
-                    "spf": "pass",
-                    "reason": [
-                        {"type": "forwarded", "comment": "relay"},
-                        {"type": "local_policy"},
-                    ],
-                },
-            },
-            "identifiers": {"header_from": "example.com"},
-            "auth_results": {"dkim": [], "spf": []},
-        }
-        result = parsedmarc._parse_report_record(record, offline=True)
-        reasons = result["policy_evaluated"]["policy_override_reasons"]
-        self.assertEqual(len(reasons), 2)
-        self.assertEqual(reasons[0]["comment"], "relay")
-        self.assertIsNone(reasons[1]["comment"])
-
-    def testParseReportRecordIdentities(self):
-        """'identities' key is mapped to 'identifiers'"""
-        record = {
-            "row": {
-                "source_ip": "192.0.2.1",
-                "count": "1",
-                "policy_evaluated": {
-                    "disposition": "none",
-                    "dkim": "pass",
-                    "spf": "pass",
-                },
-            },
-            "identities": {
-                "header_from": "Example.COM",
-                "envelope_from": "example.com",
-            },
-            "auth_results": {"dkim": [], "spf": []},
-        }
-        result = parsedmarc._parse_report_record(record, offline=True)
-        self.assertIn("identifiers", result)
-        self.assertEqual(result["identifiers"]["header_from"], "example.com")
-
-    def testParseReportRecordDkimDefaults(self):
-        """DKIM result defaults: selector='none', result='none' when missing"""
-        record = {
-            "row": {
-                "source_ip": "192.0.2.1",
-                "count": "1",
-                "policy_evaluated": {
-                    "disposition": "none",
-                    "dkim": "fail",
-                    "spf": "fail",
-                },
-            },
-            "identifiers": {"header_from": "example.com"},
-            "auth_results": {
-                "dkim": {"domain": "example.com"},
-                "spf": [],
-            },
-        }
-        result = parsedmarc._parse_report_record(record, offline=True)
-        dkim = result["auth_results"]["dkim"][0]
-        self.assertEqual(dkim["selector"], "none")
-        self.assertEqual(dkim["result"], "none")
-        self.assertIsNone(dkim["human_result"])
-
-    def testParseReportRecordSpfDefaults(self):
-        """SPF result defaults: scope='mfrom', result='none' when missing"""
-        record = {
-            "row": {
-                "source_ip": "192.0.2.1",
-                "count": "1",
-                "policy_evaluated": {
-                    "disposition": "none",
-                    "dkim": "fail",
-                    "spf": "fail",
-                },
-            },
-            "identifiers": {"header_from": "example.com"},
-            "auth_results": {
-                "dkim": [],
-                "spf": {"domain": "example.com"},
-            },
-        }
-        result = parsedmarc._parse_report_record(record, offline=True)
-        spf = result["auth_results"]["spf"][0]
-        self.assertEqual(spf["scope"], "mfrom")
-        self.assertEqual(spf["result"], "none")
-        self.assertIsNone(spf["human_result"])
-
-    def testParseReportRecordHumanResult(self):
-        """human_result field is included when present"""
-        record = {
-            "row": {
-                "source_ip": "192.0.2.1",
-                "count": "1",
-                "policy_evaluated": {
-                    "disposition": "none",
-                    "dkim": "pass",
-                    "spf": "pass",
-                },
-            },
-            "identifiers": {"header_from": "example.com"},
-            "auth_results": {
-                "dkim": [
-                    {
-                        "domain": "example.com",
-                        "selector": "s1",
-                        "result": "pass",
-                        "human_result": "good key",
-                    }
-                ],
-                "spf": [
-                    {
-                        "domain": "example.com",
-                        "scope": "mfrom",
-                        "result": "pass",
-                        "human_result": "sender valid",
-                    }
-                ],
-            },
-        }
-        result = parsedmarc._parse_report_record(record, offline=True)
-        self.assertEqual(result["auth_results"]["dkim"][0]["human_result"], "good key")
-        self.assertEqual(
-            result["auth_results"]["spf"][0]["human_result"], "sender valid"
-        )
-
-    def testParseReportRecordEnvelopeFromFallback(self):
-        """envelope_from falls back to last SPF domain when missing"""
-        record = {
-            "row": {
-                "source_ip": "192.0.2.1",
-                "count": "1",
-                "policy_evaluated": {
-                    "disposition": "none",
-                    "dkim": "pass",
-                    "spf": "pass",
-                },
-            },
-            "identifiers": {"header_from": "example.com"},
-            "auth_results": {
-                "dkim": [],
-                "spf": [
-                    {"domain": "Bounce.Example.COM", "scope": "mfrom", "result": "pass"}
-                ],
-            },
-        }
-        result = parsedmarc._parse_report_record(record, offline=True)
-        self.assertEqual(result["identifiers"]["envelope_from"], "bounce.example.com")
-
-    def testParseReportRecordEnvelopeFromNullFallback(self):
-        """envelope_from None value falls back to SPF domain"""
-        record = {
-            "row": {
-                "source_ip": "192.0.2.1",
-                "count": "1",
-                "policy_evaluated": {
-                    "disposition": "none",
-                    "dkim": "pass",
-                    "spf": "pass",
-                },
-            },
-            "identifiers": {
-                "header_from": "example.com",
-                "envelope_from": None,
-            },
-            "auth_results": {
-                "dkim": [],
-                "spf": [
-                    {"domain": "SPF.Example.COM", "scope": "mfrom", "result": "pass"}
-                ],
-            },
-        }
-        result = parsedmarc._parse_report_record(record, offline=True)
-        self.assertEqual(result["identifiers"]["envelope_from"], "spf.example.com")
-
-    def testParseReportRecordEnvelopeTo(self):
-        """envelope_to is preserved and moved correctly"""
-        record = {
-            "row": {
-                "source_ip": "192.0.2.1",
-                "count": "1",
-                "policy_evaluated": {
-                    "disposition": "none",
-                    "dkim": "pass",
-                    "spf": "pass",
-                },
-            },
-            "identifiers": {
-                "header_from": "example.com",
-                "envelope_from": "bounce@example.com",
-                "envelope_to": "recipient@example.com",
-            },
-            "auth_results": {"dkim": [], "spf": []},
-        }
-        result = parsedmarc._parse_report_record(record, offline=True)
-        self.assertEqual(result["identifiers"]["envelope_to"], "recipient@example.com")
-
-    def testParseReportRecordAlignment(self):
-        """Alignment fields computed correctly from policy_evaluated"""
-        record = {
-            "row": {
-                "source_ip": "192.0.2.1",
-                "count": "1",
-                "policy_evaluated": {
-                    "disposition": "none",
-                    "dkim": "pass",
-                    "spf": "fail",
-                },
-            },
-            "identifiers": {"header_from": "example.com"},
-            "auth_results": {"dkim": [], "spf": []},
-        }
-        result = parsedmarc._parse_report_record(record, offline=True)
-        self.assertTrue(result["alignment"]["dkim"])
-        self.assertFalse(result["alignment"]["spf"])
-        self.assertTrue(result["alignment"]["dmarc"])
-
-    # ============================================================    # Tests for _parse_smtp_tls_failure_details
-    # ============================================================
-    def testParseSmtpTlsFailureDetailsMinimal(self):
-        """Minimal failure details with just required fields"""
-        details = {
-            "result-type": "certificate-expired",
-            "failed-session-count": 5,
-        }
-        result = parsedmarc._parse_smtp_tls_failure_details(details)
-        self.assertEqual(result["result_type"], "certificate-expired")
-        self.assertEqual(result["failed_session_count"], 5)
-        self.assertNotIn("sending_mta_ip", result)
-
-    def testParseSmtpTlsFailureDetailsAllOptional(self):
-        """All optional fields included"""
-        details = {
-            "result-type": "starttls-not-supported",
-            "failed-session-count": 3,
-            "sending-mta-ip": "10.0.0.1",
-            "receiving-ip": "10.0.0.2",
-            "receiving-mx-hostname": "mx.example.com",
-            "receiving-mx-helo": "mx.example.com",
-            "additional-info-uri": "https://example.com/info",
-            "failure-reason-code": "TLS_ERROR",
-        }
-        result = parsedmarc._parse_smtp_tls_failure_details(details)
-        self.assertEqual(result["sending_mta_ip"], "10.0.0.1")
-        self.assertEqual(result["receiving_ip"], "10.0.0.2")
-        self.assertEqual(result["receiving_mx_hostname"], "mx.example.com")
-        self.assertEqual(result["receiving_mx_helo"], "mx.example.com")
-        self.assertEqual(result["additional_info_uri"], "https://example.com/info")
-        self.assertEqual(result["failure_reason_code"], "TLS_ERROR")
-
-    def testParseSmtpTlsFailureDetailsMissingRequired(self):
-        """Missing required field raises InvalidSMTPTLSReport"""
-        with self.assertRaises(parsedmarc.InvalidSMTPTLSReport):
-            parsedmarc._parse_smtp_tls_failure_details({"result-type": "err"})
-
-    # ============================================================    # Tests for _parse_smtp_tls_report_policy
-    # ============================================================
-    def testParseSmtpTlsReportPolicyValid(self):
-        """Valid STS policy parses correctly"""
-        policy = {
-            "policy": {
-                "policy-type": "sts",
-                "policy-domain": "example.com",
-                "policy-string": ["version: STSv1", "mode: enforce"],
-                "mx-host-pattern": ["*.example.com"],
-            },
-            "summary": {
-                "total-successful-session-count": 100,
-                "total-failure-session-count": 2,
-            },
-        }
-        result = parsedmarc._parse_smtp_tls_report_policy(policy)
-        self.assertEqual(result["policy_type"], "sts")
-        self.assertEqual(result["policy_domain"], "example.com")
-        self.assertEqual(result["policy_strings"], ["version: STSv1", "mode: enforce"])
-        self.assertEqual(result["mx_host_patterns"], ["*.example.com"])
-        self.assertEqual(result["successful_session_count"], 100)
-        self.assertEqual(result["failed_session_count"], 2)
-
-    def testParseSmtpTlsReportPolicyInvalidType(self):
-        """Invalid policy type raises InvalidSMTPTLSReport"""
-        policy = {
-            "policy": {
-                "policy-type": "invalid",
-                "policy-domain": "example.com",
-            },
-            "summary": {
-                "total-successful-session-count": 0,
-                "total-failure-session-count": 0,
-            },
-        }
-        with self.assertRaises(parsedmarc.InvalidSMTPTLSReport):
-            parsedmarc._parse_smtp_tls_report_policy(policy)
-
-    def testParseSmtpTlsReportPolicyEmptyPolicyString(self):
-        """Empty policy-string list is not included"""
-        policy = {
-            "policy": {
-                "policy-type": "sts",
-                "policy-domain": "example.com",
-                "policy-string": [],
-                "mx-host-pattern": [],
-            },
-            "summary": {
-                "total-successful-session-count": 50,
-                "total-failure-session-count": 0,
-            },
-        }
-        result = parsedmarc._parse_smtp_tls_report_policy(policy)
-        self.assertNotIn("policy_strings", result)
-        self.assertNotIn("mx_host_patterns", result)
-
-    def testParseSmtpTlsReportPolicyWithFailureDetails(self):
-        """Policy with failure-details parses nested details"""
-        policy = {
-            "policy": {
-                "policy-type": "sts",
-                "policy-domain": "example.com",
-            },
-            "summary": {
-                "total-successful-session-count": 10,
-                "total-failure-session-count": 1,
-            },
-            "failure-details": [
-                {
-                    "result-type": "certificate-expired",
-                    "failed-session-count": 1,
-                }
-            ],
-        }
-        result = parsedmarc._parse_smtp_tls_report_policy(policy)
-        self.assertEqual(len(result["failure_details"]), 1)
-        self.assertEqual(
-            result["failure_details"][0]["result_type"], "certificate-expired"
-        )
-
-    def testParseSmtpTlsReportPolicyMissingField(self):
-        """Missing required policy field raises InvalidSMTPTLSReport"""
-        policy = {"policy": {"policy-type": "sts"}, "summary": {}}
-        with self.assertRaises(parsedmarc.InvalidSMTPTLSReport):
-            parsedmarc._parse_smtp_tls_report_policy(policy)
-
-    # ============================================================    # Tests for parse_smtp_tls_report_json
-    # ============================================================
-    def testParseSmtpTlsReportJsonValid(self):
-        """Valid SMTP TLS JSON report parses correctly"""
-        report = json.dumps(
-            {
-                "organization-name": "Example Corp",
-                "date-range": {
-                    "start-datetime": "2024-01-01T00:00:00Z",
-                    "end-datetime": "2024-01-02T00:00:00Z",
-                },
-                "contact-info": "admin@example.com",
-                "report-id": "report-123",
-                "policies": [
-                    {
-                        "policy": {
-                            "policy-type": "sts",
-                            "policy-domain": "example.com",
-                        },
-                        "summary": {
-                            "total-successful-session-count": 50,
-                            "total-failure-session-count": 0,
-                        },
-                    }
-                ],
-            }
-        )
-        result = parsedmarc.parse_smtp_tls_report_json(report)
-        self.assertEqual(result["organization_name"], "Example Corp")
-        self.assertEqual(result["report_id"], "report-123")
-        self.assertEqual(len(result["policies"]), 1)
-
-    def testParseSmtpTlsReportJsonBytes(self):
-        """SMTP TLS report as bytes parses correctly"""
-        report = json.dumps(
-            {
-                "organization-name": "Org",
-                "date-range": {
-                    "start-datetime": "2024-01-01",
-                    "end-datetime": "2024-01-02",
-                },
-                "contact-info": "a@b.com",
-                "report-id": "r1",
-                "policies": [
-                    {
-                        "policy": {"policy-type": "tlsa", "policy-domain": "a.com"},
-                        "summary": {
-                            "total-successful-session-count": 1,
-                            "total-failure-session-count": 0,
-                        },
-                    }
-                ],
-            }
-        ).encode("utf-8")
-        result = parsedmarc.parse_smtp_tls_report_json(report)
-        self.assertEqual(result["organization_name"], "Org")
-
-    def testParseSmtpTlsReportJsonMissingField(self):
-        """Missing required field raises InvalidSMTPTLSReport"""
-        report = json.dumps({"organization-name": "Org"})
-        with self.assertRaises(parsedmarc.InvalidSMTPTLSReport):
-            parsedmarc.parse_smtp_tls_report_json(report)
-
-    def testParseSmtpTlsReportJsonPoliciesNotList(self):
-        """Non-list policies raises InvalidSMTPTLSReport"""
-        report = json.dumps(
-            {
-                "organization-name": "Org",
-                "date-range": {
-                    "start-datetime": "2024-01-01",
-                    "end-datetime": "2024-01-02",
-                },
-                "contact-info": "a@b.com",
-                "report-id": "r1",
-                "policies": "not-a-list",
-            }
-        )
-        with self.assertRaises(parsedmarc.InvalidSMTPTLSReport):
-            parsedmarc.parse_smtp_tls_report_json(report)
-
-    # ============================================================    # Tests for aggregate report parsing (validation warnings, etc.)
-    # ============================================================
-    def testAggregateReportInvalidNpWarning(self):
-        """Invalid np value is preserved but logs warning"""
-        xml = """<?xml version="1.0"?>
-        <feedback>
-            <version>1.0</version>
-            <report_metadata>
-                <org_name>Test Org</org_name>
-                <email>test@example.com</email>
-                <report_id>test-np-invalid</report_id>
-                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
-            </report_metadata>
-            <policy_published>
-                <domain>example.com</domain>
-                <p>none</p>
-                <np>banana</np>
-                <testing>maybe</testing>
-                <discovery_method>magic</discovery_method>
-            </policy_published>
-            <record>
-                <row>
-                    <source_ip>192.0.2.1</source_ip>
-                    <count>1</count>
-                    <policy_evaluated>
-                        <disposition>none</disposition>
-                        <dkim>pass</dkim>
-                        <spf>pass</spf>
-                    </policy_evaluated>
-                </row>
-                <identifiers><header_from>example.com</header_from></identifiers>
-                <auth_results>
-                    <spf><domain>example.com</domain><result>pass</result></spf>
-                </auth_results>
-            </record>
-        </feedback>"""
-        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-        # Invalid values are still stored
-        self.assertEqual(report["policy_published"]["np"], "banana")
-        self.assertEqual(report["policy_published"]["testing"], "maybe")
-        self.assertEqual(report["policy_published"]["discovery_method"], "magic")
-
-    def testAggregateReportPassDisposition(self):
-        """'pass' as valid disposition is preserved"""
-        xml = """<?xml version="1.0"?>
-        <feedback>
-            <report_metadata>
-                <org_name>TestOrg</org_name>
-                <email>test@example.com</email>
-                <report_id>test-pass</report_id>
-                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
-            </report_metadata>
-            <policy_published>
-                <domain>example.com</domain>
-                <p>reject</p>
-            </policy_published>
-            <record>
-                <row>
-                    <source_ip>192.0.2.1</source_ip>
-                    <count>1</count>
-                    <policy_evaluated>
-                        <disposition>pass</disposition>
-                        <dkim>pass</dkim>
-                        <spf>pass</spf>
-                    </policy_evaluated>
-                </row>
-                <identifiers><header_from>example.com</header_from></identifiers>
-                <auth_results>
-                    <spf><domain>example.com</domain><result>pass</result></spf>
-                </auth_results>
-            </record>
-        </feedback>"""
-        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-        self.assertEqual(
-            report["records"][0]["policy_evaluated"]["disposition"], "pass"
-        )
-
-    def testAggregateReportMultipleRecords(self):
-        """Reports with multiple records are all parsed"""
-        xml = """<?xml version="1.0"?>
-        <feedback>
-            <report_metadata>
-                <org_name>TestOrg</org_name>
-                <email>test@example.com</email>
-                <report_id>test-multi</report_id>
-                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
-            </report_metadata>
-            <policy_published>
-                <domain>example.com</domain>
-                <p>none</p>
-            </policy_published>
-            <record>
-                <row>
-                    <source_ip>192.0.2.1</source_ip>
-                    <count>10</count>
-                    <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
-                </row>
-                <identifiers><header_from>example.com</header_from></identifiers>
-                <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
-            </record>
-            <record>
-                <row>
-                    <source_ip>192.0.2.2</source_ip>
-                    <count>5</count>
-                    <policy_evaluated><disposition>quarantine</disposition><dkim>fail</dkim><spf>fail</spf></policy_evaluated>
-                </row>
-                <identifiers><header_from>example.com</header_from></identifiers>
-                <auth_results><spf><domain>example.com</domain><result>fail</result></spf></auth_results>
-            </record>
-        </feedback>"""
-        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-        self.assertEqual(len(report["records"]), 2)
-        self.assertEqual(report["records"][0]["count"], 10)
-        self.assertEqual(report["records"][1]["count"], 5)
-
-    def testAggregateReportInvalidXmlRecovery(self):
-        """Badly formed XML is recovered via lxml"""
-        xml = '<?xml version="1.0"?><feedback><report_metadata><org_name>Test</org_name><email>t@e.com</email><report_id>r1</report_id><date_range><begin>1704067200</begin><end>1704153599</end></date_range></report_metadata><policy_published><domain>example.com</domain><p>none</p></policy_published><record><row><source_ip>192.0.2.1</source_ip><count>1</count><policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated></row><identifiers><header_from>example.com</header_from></identifiers><auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results></record></feedback>'
-        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-        self.assertEqual(report["report_metadata"]["report_id"], "r1")
-
-    def testAggregateReportCsvRowsContainRFC9990Fields(self):
-        """CSV rows include np, testing, discovery_method columns"""
-        result = parsedmarc.parse_report_file(
-            "samples/aggregate/rfc9990-sample.xml",
-            always_use_local_files=True,
-            offline=True,
-        )
-        report = cast(AggregateReport, result["report"])
-        rows = parsedmarc.parsed_aggregate_reports_to_csv_rows(report)
-        self.assertTrue(len(rows) > 0)
-        row = rows[0]
-        self.assertIn("np", row)
-        self.assertIn("testing", row)
-        self.assertIn("discovery_method", row)
-        self.assertIn("source_ip_address", row)
-        self.assertIn("dkim_domains", row)
-        self.assertIn("spf_domains", row)
-
-    def testAggregateReportSchemaVersion(self):
-        """RFC 9990 report with <version> returns correct xml_schema"""
-        xml = """<?xml version="1.0"?>
-        <feedback>
-            <version>1.0</version>
-            <report_metadata>
-                <org_name>TestOrg</org_name>
-                <email>test@example.com</email>
-                <report_id>test-version</report_id>
-                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
-            </report_metadata>
-            <policy_published>
-                <domain>example.com</domain>
-                <p>none</p>
-            </policy_published>
-            <record>
-                <row>
-                    <source_ip>192.0.2.1</source_ip>
-                    <count>1</count>
-                    <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
-                </row>
-                <identifiers><header_from>example.com</header_from></identifiers>
-                <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
-            </record>
-        </feedback>"""
-        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-        self.assertEqual(report["xml_schema"], "1.0")
-
-    def testAggregateReportDraftSchema(self):
-        """Report without <version> defaults to 'draft' schema"""
-        xml = """<?xml version="1.0"?>
-        <feedback>
-            <report_metadata>
-                <org_name>TestOrg</org_name>
-                <email>test@example.com</email>
-                <report_id>test-draft</report_id>
-                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
-            </report_metadata>
-            <policy_published>
-                <domain>example.com</domain>
-                <p>none</p>
-            </policy_published>
-            <record>
-                <row>
-                    <source_ip>192.0.2.1</source_ip>
-                    <count>1</count>
-                    <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
-                </row>
-                <identifiers><header_from>example.com</header_from></identifiers>
-                <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
-            </record>
-        </feedback>"""
-        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-        self.assertEqual(report["xml_schema"], "draft")
-
-    def testAggregateReportGeneratorField(self):
-        """Generator field is correctly extracted"""
-        xml = """<?xml version="1.0"?>
-        <feedback>
-            <report_metadata>
-                <org_name>TestOrg</org_name>
-                <email>test@example.com</email>
-                <report_id>test-gen</report_id>
-                <generator>My Reporter v1.0</generator>
-                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
-            </report_metadata>
-            <policy_published>
-                <domain>example.com</domain>
-                <p>none</p>
-            </policy_published>
-            <record>
-                <row>
-                    <source_ip>192.0.2.1</source_ip>
-                    <count>1</count>
-                    <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
-                </row>
-                <identifiers><header_from>example.com</header_from></identifiers>
-                <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
-            </record>
-        </feedback>"""
-        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-        self.assertEqual(report["report_metadata"]["generator"], "My Reporter v1.0")
-
-    def testAggregateReportReportErrors(self):
-        """Report errors in metadata are captured"""
-        xml = """<?xml version="1.0"?>
-        <feedback>
-            <report_metadata>
-                <org_name>TestOrg</org_name>
-                <email>test@example.com</email>
-                <report_id>test-err</report_id>
-                <error>Some error</error>
-                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
-            </report_metadata>
-            <policy_published>
-                <domain>example.com</domain>
-                <p>none</p>
-            </policy_published>
-            <record>
-                <row>
-                    <source_ip>192.0.2.1</source_ip>
-                    <count>1</count>
-                    <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
-                </row>
-                <identifiers><header_from>example.com</header_from></identifiers>
-                <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
-            </record>
-        </feedback>"""
-        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-        self.assertIn("Some error", report["report_metadata"]["errors"])
-
-    def testAggregateReportPolicyDefaults(self):
-        """Policy defaults: adkim/aspf='r', sp=p, pct/fo=None"""
-        xml = """<?xml version="1.0"?>
-        <feedback>
-            <report_metadata>
-                <org_name>TestOrg</org_name>
-                <email>test@example.com</email>
-                <report_id>test-defaults</report_id>
-                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
-            </report_metadata>
-            <policy_published>
-                <domain>example.com</domain>
-                <p>reject</p>
-            </policy_published>
-            <record>
-                <row>
-                    <source_ip>192.0.2.1</source_ip>
-                    <count>1</count>
-                    <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
-                </row>
-                <identifiers><header_from>example.com</header_from></identifiers>
-                <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
-            </record>
-        </feedback>"""
-        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-        pp = report["policy_published"]
-        self.assertEqual(pp["adkim"], "r")
-        self.assertEqual(pp["aspf"], "r")
-        self.assertEqual(pp["sp"], "reject")  # defaults to p
-        self.assertIsNone(pp["pct"])
-        self.assertIsNone(pp["fo"])
-        self.assertIsNone(pp["np"])
-        self.assertIsNone(pp["testing"])
-        self.assertIsNone(pp["discovery_method"])
-
-    def testMagicXmlTagDetection(self):
-        """XML without declaration (starting with '<') is extracted"""
-        xml_no_decl = b"<feedback><report_metadata><org_name>T</org_name><email>a@b.com</email><report_id>r1</report_id><date_range><begin>1704067200</begin><end>1704153599</end></date_range></report_metadata><policy_published><domain>example.com</domain><p>none</p></policy_published><record><row><source_ip>192.0.2.1</source_ip><count>1</count><policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated></row><identifiers><header_from>example.com</header_from></identifiers><auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results></record></feedback>"
-        self.assertTrue(xml_no_decl.startswith(parsedmarc.MAGIC_XML_TAG))
-        # Ensure it extracts as XML
-        result = parsedmarc.extract_report(xml_no_decl)
-        self.assertIn("<feedback>", result)
-
-    # ============================================================    # Tests for parsedmarc/utils.py
-    # ============================================================
-    def testTimestampToDatetime(self):
-        """timestamp_to_datetime converts UNIX timestamp to datetime"""
-        from datetime import datetime
-
-        ts = 1704067200
-        dt = parsedmarc.utils.timestamp_to_datetime(ts)
-        self.assertIsInstance(dt, datetime)
-        # Should match stdlib fromtimestamp (local time)
-        self.assertEqual(dt, datetime.fromtimestamp(ts))
-
-    def testTimestampToHuman(self):
-        """timestamp_to_human returns formatted string"""
-        result = parsedmarc.utils.timestamp_to_human(1704067200)
-        self.assertRegex(result, r"\d{4}-\d{2}-\d{2} \d{2}:\d{2}:\d{2}")
-
-    def testHumanTimestampToDatetime(self):
-        """human_timestamp_to_datetime parses timestamp string"""
-        dt = parsedmarc.utils.human_timestamp_to_datetime("2024-01-01 00:00:00")
-        self.assertIsInstance(dt, datetime)
-        self.assertEqual(dt.year, 2024)
-        self.assertEqual(dt.month, 1)
-        self.assertEqual(dt.day, 1)
-
-    def testHumanTimestampToDatetimeUtc(self):
-        """human_timestamp_to_datetime with to_utc=True returns UTC"""
-        dt = parsedmarc.utils.human_timestamp_to_datetime(
-            "2024-01-01 12:00:00", to_utc=True
-        )
-        self.assertEqual(dt.tzinfo, timezone.utc)
-
-    def testHumanTimestampToDatetimeParenthesisStripping(self):
-        """Parenthesized content is stripped from timestamps"""
-        dt = parsedmarc.utils.human_timestamp_to_datetime(
-            "Mon, 01 Jan 2024 00:00:00 +0000 (UTC)"
-        )
-        self.assertEqual(dt.year, 2024)
-
-    def testHumanTimestampToDatetimeNegativeZero(self):
-        """-0000 timezone is handled"""
-        dt = parsedmarc.utils.human_timestamp_to_datetime("2024-01-01 00:00:00 -0000")
-        self.assertEqual(dt.year, 2024)
-
-    def testHumanTimestampToUnixTimestamp(self):
-        """human_timestamp_to_unix_timestamp converts to int"""
-        ts = parsedmarc.utils.human_timestamp_to_unix_timestamp("2024-01-01 00:00:00")
-        self.assertIsInstance(ts, int)
-
-    def testHumanTimestampToUnixTimestampWithT(self):
-        """T separator in timestamp is handled"""
-        ts = parsedmarc.utils.human_timestamp_to_unix_timestamp("2024-01-01T00:00:00")
-        self.assertIsInstance(ts, int)
-
-    def testGetIpAddressCountry(self):
-        """get_ip_address_country returns country code using bundled DBIP"""
-        # 8.8.8.8 is a well-known Google DNS IP in US
-        country = parsedmarc.utils.get_ip_address_country("8.8.8.8")
-        self.assertEqual(country, "US")
-
-    def testGetIpAddressCountryNotFound(self):
-        """get_ip_address_country returns None for reserved IP"""
-        country = parsedmarc.utils.get_ip_address_country("127.0.0.1")
-        self.assertIsNone(country)
-
-    def testGetServiceFromReverseDnsBaseDomainOffline(self):
-        """get_service_from_reverse_dns_base_domain in offline mode"""
-        result = parsedmarc.utils.get_service_from_reverse_dns_base_domain(
-            "google.com", offline=True
-        )
-        self.assertIn("Google", result["name"])
-        self.assertIsNotNone(result["type"])
-
-    def testGetServiceFromReverseDnsBaseDomainUnknown(self):
-        """Unknown base domain returns domain as name and None as type"""
-        result = parsedmarc.utils.get_service_from_reverse_dns_base_domain(
-            "unknown-domain-xyz.example", offline=True
-        )
-        self.assertEqual(result["name"], "unknown-domain-xyz.example")
-        self.assertIsNone(result["type"])
-
-    def testGetIpAddressInfoOffline(self):
-        """get_ip_address_info in offline mode returns country but no DNS"""
-        info = parsedmarc.utils.get_ip_address_info("8.8.8.8", offline=True)
-        self.assertEqual(info["ip_address"], "8.8.8.8")
-        self.assertEqual(info["country"], "US")
-        self.assertIsNone(info["reverse_dns"])
-
-    def testGetIpAddressInfoCache(self):
-        """get_ip_address_info uses cache on second call"""
-        from expiringdict import ExpiringDict
-
-        cache = ExpiringDict(max_len=100, max_age_seconds=60)
-        with patch("parsedmarc.utils.get_reverse_dns", return_value="dns.google"):
-            info1 = parsedmarc.utils.get_ip_address_info(
-                "8.8.8.8",
-                offline=False,
-                cache=cache,
-                always_use_local_files=True,
-            )
-        self.assertIn("8.8.8.8", cache)
-        info2 = parsedmarc.utils.get_ip_address_info(
-            "8.8.8.8", offline=False, cache=cache
-        )
-        self.assertEqual(info1["ip_address"], info2["ip_address"])
-        self.assertEqual(info2["reverse_dns"], "dns.google")
-
-    def testParseEmailAddressWithDisplayName(self):
-        """parse_email_address with display name"""
-        result = parsedmarc.utils.parse_email_address(("John Doe", "john@example.com"))  # type: ignore[arg-type]
-        self.assertEqual(result["display_name"], "John Doe")
-        self.assertEqual(result["address"], "john@example.com")
-        self.assertEqual(result["local"], "john")
-        self.assertEqual(result["domain"], "example.com")
-
-    def testParseEmailAddressWithoutDisplayName(self):
-        """parse_email_address with empty display name"""
-        result = parsedmarc.utils.parse_email_address(("", "john@example.com"))  # type: ignore[arg-type]
-        self.assertIsNone(result["display_name"])
-        self.assertEqual(result["address"], "john@example.com")
-
-    def testParseEmailAddressNoAt(self):
-        """parse_email_address with no @ returns None local/domain"""
-        result = parsedmarc.utils.parse_email_address(("", "localonly"))  # type: ignore[arg-type]
-        self.assertIsNone(result["local"])
-        self.assertIsNone(result["domain"])
-
-    def testGetFilenameSafeString(self):
-        """get_filename_safe_string removes invalid chars"""
-        result = parsedmarc.utils.get_filename_safe_string('file/name:with"bad*chars')
-        self.assertNotIn("/", result)
-        self.assertNotIn(":", result)
-        self.assertNotIn('"', result)
-        self.assertNotIn("*", result)
-
-    def testGetFilenameSafeStringNone(self):
-        """get_filename_safe_string with None returns 'None'"""
-        result = parsedmarc.utils.get_filename_safe_string(None)  # type: ignore[arg-type]
-        self.assertEqual(result, "None")
-
-    def testGetFilenameSafeStringLong(self):
-        """get_filename_safe_string truncates to 100 chars"""
-        result = parsedmarc.utils.get_filename_safe_string("a" * 200)
-        self.assertEqual(len(result), 100)
-
-    def testGetFilenameSafeStringTrailingDot(self):
-        """get_filename_safe_string strips trailing dots"""
-        result = parsedmarc.utils.get_filename_safe_string("filename...")
-        self.assertFalse(result.endswith("."))
-
-    def testIsMboxNonMbox(self):
-        """is_mbox returns False for non-mbox file"""
-        result = parsedmarc.utils.is_mbox("samples/empty.xml")
-        self.assertFalse(result)
-
-    def testIsOutlookMsgNonMsg(self):
-        """is_outlook_msg returns False for non-MSG content"""
-        self.assertFalse(parsedmarc.utils.is_outlook_msg(b"not an outlook msg"))
-        self.assertFalse(parsedmarc.utils.is_outlook_msg("string content"))
-
-    def testIsOutlookMsgMagic(self):
-        """is_outlook_msg returns True for correct magic bytes"""
-        magic = b"\xd0\xcf\x11\xe0\xa1\xb1\x1a\xe1" + b"\x00" * 100
-        self.assertTrue(parsedmarc.utils.is_outlook_msg(magic))
-
-    # ============================================================    # Tests for output modules (mocked)
-    # ============================================================
-    def testWebhookClientInit(self):
-        """WebhookClient initializes with correct attributes"""
-        from parsedmarc.webhook import WebhookClient
-
-        client = WebhookClient(
-            aggregate_url="http://agg.example.com",
-            failure_url="http://fail.example.com",
-            smtp_tls_url="http://tls.example.com",
-        )
-        self.assertEqual(client.aggregate_url, "http://agg.example.com")
-        self.assertEqual(client.failure_url, "http://fail.example.com")
-        self.assertEqual(client.smtp_tls_url, "http://tls.example.com")
-        self.assertEqual(client.timeout, 60)
-
-    def testWebhookClientSaveMethods(self):
-        """WebhookClient save methods call _send_to_webhook"""
-        from parsedmarc.webhook import WebhookClient
-
-        client = WebhookClient("http://a", "http://f", "http://t")
-        client.session = MagicMock()
-        client.save_aggregate_report_to_webhook('{"test": 1}')
-        client.session.post.assert_called_with(
-            "http://a", data='{"test": 1}', timeout=60
-        )
-        client.save_failure_report_to_webhook('{"fail": 1}')
-        client.session.post.assert_called_with(
-            "http://f", data='{"fail": 1}', timeout=60
-        )
-        client.save_smtp_tls_report_to_webhook('{"tls": 1}')
-        client.session.post.assert_called_with(
-            "http://t", data='{"tls": 1}', timeout=60
-        )
-
-    def testWebhookBackwardCompatAlias(self):
-        """WebhookClient forensic alias points to failure method"""
-        from parsedmarc.webhook import WebhookClient
-
-        self.assertIs(
-            WebhookClient.save_forensic_report_to_webhook,  # type: ignore[attr-defined]
-            WebhookClient.save_failure_report_to_webhook,
-        )
-
-    def testKafkaStripMetadata(self):
-        """KafkaClient.strip_metadata extracts metadata to root"""
-        from parsedmarc.kafkaclient import KafkaClient
-
-        report = {
-            "report_metadata": {
-                "org_name": "TestOrg",
-                "org_email": "test@example.com",
-                "report_id": "r-123",
-                "begin_date": "2024-01-01",
-                "end_date": "2024-01-02",
-            },
-            "records": [],
-        }
-        result = KafkaClient.strip_metadata(report)
-        self.assertEqual(result["org_name"], "TestOrg")
-        self.assertEqual(result["org_email"], "test@example.com")
-        self.assertEqual(result["report_id"], "r-123")
-        self.assertNotIn("report_metadata", result)
-
-    def testKafkaGenerateDateRange(self):
-        """KafkaClient.generate_date_range generates date range list"""
-        from parsedmarc.kafkaclient import KafkaClient
-
-        report = {
-            "report_metadata": {
-                "begin_date": "2024-01-01 00:00:00",
-                "end_date": "2024-01-02 00:00:00",
-            }
-        }
-        result = KafkaClient.generate_date_range(report)
-        self.assertEqual(len(result), 2)
-        self.assertIn("2024-01-01", result[0])
-        self.assertIn("2024-01-02", result[1])
-
-    def testSplunkHECClientInit(self):
-        """HECClient initializes with correct URL and headers"""
-        from parsedmarc.splunk import HECClient
-
-        client = HECClient(
-            url="https://splunk.example.com:8088",
-            access_token="my-token",
-            index="main",
-        )
-        self.assertIn("/services/collector/event/1.0", client.url)
-        self.assertEqual(client.access_token, "my-token")
-        self.assertEqual(client.index, "main")
-        self.assertEqual(client.source, "parsedmarc")
-        self.assertIn("Splunk my-token", client.session.headers["Authorization"])
-
-    def testSplunkHECClientStripTokenPrefix(self):
-        """HECClient strips 'Splunk ' prefix from token"""
-        from parsedmarc.splunk import HECClient
-
-        client = HECClient(
-            url="https://splunk.example.com",
-            access_token="Splunk my-token",
-            index="main",
-        )
-        self.assertEqual(client.access_token, "my-token")
-
-    def testSplunkBackwardCompatAlias(self):
-        """HECClient forensic alias points to failure method"""
-        from parsedmarc.splunk import HECClient
-
-        self.assertIs(
-            HECClient.save_forensic_reports_to_splunk,  # type: ignore[attr-defined]
-            HECClient.save_failure_reports_to_splunk,
-        )
-
-    def testSyslogClientUdpInit(self):
-        """SyslogClient creates UDP handler"""
-        from parsedmarc.syslog import SyslogClient
-
-        client = SyslogClient("localhost", 514, protocol="udp")
-        self.assertEqual(client.server_name, "localhost")
-        self.assertEqual(client.server_port, 514)
-        self.assertEqual(client.protocol, "udp")
-
-    def testSyslogClientInvalidProtocol(self):
-        """SyslogClient with invalid protocol raises ValueError"""
-        from parsedmarc.syslog import SyslogClient
-
-        with self.assertRaises(ValueError):
-            SyslogClient("localhost", 514, protocol="invalid")
-
-    def testSyslogBackwardCompatAlias(self):
-        """SyslogClient forensic alias points to failure method"""
-        from parsedmarc.syslog import SyslogClient
-
-        self.assertIs(
-            SyslogClient.save_forensic_report_to_syslog,  # type: ignore[attr-defined]
-            SyslogClient.save_failure_report_to_syslog,
-        )
-
-    def testLogAnalyticsConfig(self):
-        """LogAnalyticsConfig stores all fields"""
-        from parsedmarc.loganalytics import LogAnalyticsConfig
-
-        config = LogAnalyticsConfig(
-            client_id="cid",
-            client_secret="csec",
-            tenant_id="tid",
-            dce="https://dce.example.com",
-            dcr_immutable_id="dcr-123",
-            dcr_aggregate_stream="agg-stream",
-            dcr_failure_stream="fail-stream",
-            dcr_smtp_tls_stream="tls-stream",
-        )
-        self.assertEqual(config.client_id, "cid")
-        self.assertEqual(config.client_secret, "csec")
-        self.assertEqual(config.tenant_id, "tid")
-        self.assertEqual(config.dce, "https://dce.example.com")
-        self.assertEqual(config.dcr_immutable_id, "dcr-123")
-        self.assertEqual(config.dcr_aggregate_stream, "agg-stream")
-        self.assertEqual(config.dcr_failure_stream, "fail-stream")
-        self.assertEqual(config.dcr_smtp_tls_stream, "tls-stream")
-
-    def testLogAnalyticsClientValidationError(self):
-        """LogAnalyticsClient raises on missing required config"""
-        from parsedmarc.loganalytics import LogAnalyticsClient, LogAnalyticsException
-
-        with self.assertRaises(LogAnalyticsException):
-            LogAnalyticsClient(
-                client_id="",
-                client_secret="csec",
-                tenant_id="tid",
-                dce="https://dce.example.com",
-                dcr_immutable_id="dcr-123",
-                dcr_aggregate_stream="agg",
-                dcr_failure_stream="fail",
-                dcr_smtp_tls_stream="tls",
-            )
-
-    def testSmtpTlsCsvRows(self):
-        """parsed_smtp_tls_reports_to_csv_rows produces correct rows"""
-        report_json = json.dumps(
-            {
-                "organization-name": "Org",
-                "date-range": {
-                    "start-datetime": "2024-01-01T00:00:00Z",
-                    "end-datetime": "2024-01-02T00:00:00Z",
-                },
-                "contact-info": "a@b.com",
-                "report-id": "r1",
-                "policies": [
-                    {
-                        "policy": {
-                            "policy-type": "sts",
-                            "policy-domain": "example.com",
-                            "policy-string": ["v: STSv1"],
-                            "mx-host-pattern": ["*.example.com"],
-                        },
-                        "summary": {
-                            "total-successful-session-count": 10,
-                            "total-failure-session-count": 1,
-                        },
-                        "failure-details": [
-                            {"result-type": "cert-expired", "failed-session-count": 1}
-                        ],
-                    }
-                ],
-            }
-        )
-        parsed = parsedmarc.parse_smtp_tls_report_json(report_json)
-        rows = parsedmarc.parsed_smtp_tls_reports_to_csv_rows(parsed)
-        self.assertTrue(len(rows) >= 2)
-        self.assertEqual(rows[0]["organization_name"], "Org")
-        self.assertEqual(rows[0]["policy_domain"], "example.com")
-
-    def testParsedAggregateReportsToCsvRowsList(self):
-        """parsed_aggregate_reports_to_csv_rows handles list of reports"""
-        result = parsedmarc.parse_report_file(
-            "samples/aggregate/rfc9990-sample.xml",
-            always_use_local_files=True,
-            offline=True,
-        )
-        report = cast(AggregateReport, result["report"])
-        # Pass as a list
-        rows = parsedmarc.parsed_aggregate_reports_to_csv_rows([report])
-        self.assertTrue(len(rows) > 0)
-        # Verify non-str/int/bool values are cleaned
-        for row in rows:
-            for v in row.values():
-                self.assertIn(type(v), [str, int, bool])
-
-    def testExceptionHierarchy(self):
-        """Exception class hierarchy is correct"""
-        self.assertTrue(issubclass(parsedmarc.ParserError, RuntimeError))
-        self.assertTrue(
-            issubclass(parsedmarc.InvalidDMARCReport, parsedmarc.ParserError)
-        )
-        self.assertTrue(
-            issubclass(parsedmarc.InvalidAggregateReport, parsedmarc.InvalidDMARCReport)
-        )
-        self.assertTrue(
-            issubclass(parsedmarc.InvalidFailureReport, parsedmarc.InvalidDMARCReport)
-        )
-        self.assertTrue(
-            issubclass(parsedmarc.InvalidSMTPTLSReport, parsedmarc.ParserError)
-        )
-        self.assertIs(parsedmarc.InvalidForensicReport, parsedmarc.InvalidFailureReport)
-
-    def testAggregateReportNormalization(self):
-        """Reports spanning >24h get normalized per day"""
-        xml = """<?xml version="1.0"?>
-        <feedback>
-            <report_metadata>
-                <org_name>TestOrg</org_name>
-                <email>test@example.com</email>
-                <report_id>test-norm</report_id>
-                <date_range><begin>1704067200</begin><end>1704326400</end></date_range>
-            </report_metadata>
-            <policy_published>
-                <domain>example.com</domain>
-                <p>none</p>
-            </policy_published>
-            <record>
-                <row>
-                    <source_ip>192.0.2.1</source_ip>
-                    <count>90</count>
-                    <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
-                </row>
-                <identifiers><header_from>example.com</header_from></identifiers>
-                <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
-            </record>
-        </feedback>"""
-        # Span is 259200 seconds (3 days), exceeds default 24h threshold
-        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-        self.assertTrue(report["report_metadata"]["timespan_requires_normalization"])
-        # Records should be split across days
-        self.assertTrue(len(report["records"]) > 1)
-        total = sum(r["count"] for r in report["records"])
-        self.assertEqual(total, 90)
-        for r in report["records"]:
-            self.assertTrue(r["normalized_timespan"])  # type: ignore[typeddict-item]
-
-    # ===================================================================
-    # Additional backward compatibility alias tests
-    # ===================================================================
-
-    def testGelfBackwardCompatAlias(self):
-        """GelfClient forensic alias points to failure method"""
-        from parsedmarc.gelf import GelfClient
-
-        self.assertIs(
-            GelfClient.save_forensic_report_to_gelf,  # type: ignore[attr-defined]
-            GelfClient.save_failure_report_to_gelf,
-        )
-
-    def testS3BackwardCompatAlias(self):
-        """S3Client forensic alias points to failure method"""
-        from parsedmarc.s3 import S3Client
-
-        self.assertIs(
-            S3Client.save_forensic_report_to_s3,  # type: ignore[attr-defined]
-            S3Client.save_failure_report_to_s3,
-        )
-
-    def testKafkaBackwardCompatAlias(self):
-        """KafkaClient forensic alias points to failure method"""
-        from parsedmarc.kafkaclient import KafkaClient
-
-        self.assertIs(
-            KafkaClient.save_forensic_reports_to_kafka,  # type: ignore[attr-defined]
-            KafkaClient.save_failure_reports_to_kafka,
-        )
-
-    # ===================================================================
-    # Additional extract/parse tests
-    # ===================================================================
-
-    def testExtractReportFromFilePathNotFound(self):
-        """extract_report_from_file_path raises ParserError for missing file"""
-        with self.assertRaises(parsedmarc.ParserError):
-            parsedmarc.extract_report_from_file_path("nonexistent_file.xml")
-
-    def testExtractReportInvalidArchive(self):
-        """extract_report raises ParserError for unrecognized binary content"""
-        with self.assertRaises(parsedmarc.ParserError):
-            parsedmarc.extract_report(b"\x00\x01\x02\x03\x04\x05\x06\x07")
-
-    def testParseAggregateReportFile(self):
-        """parse_aggregate_report_file parses bytes input directly"""
-        print()
-        sample_path = "samples/aggregate/rfc9990-sample.xml"
-        print("Testing {0}: ".format(sample_path), end="")
-        with open(sample_path, "rb") as f:
-            data = f.read()
-        report = parsedmarc.parse_aggregate_report_file(
-            data,
-            offline=True,
-            always_use_local_files=True,
-        )
-        self.assertEqual(report["report_metadata"]["org_name"], "Sample Reporter")
-        self.assertEqual(report["policy_published"]["domain"], "example.com")
-        print("Passed!")
-
-    def testParseInvalidAggregateSample(self):
-        """Test invalid aggregate samples are handled"""
-        print()
-        sample_paths = glob("samples/aggregate_invalid/*")
-        for sample_path in sample_paths:
-            if os.path.isdir(sample_path):
-                continue
-            print("Testing {0}: ".format(sample_path), end="")
-            with self.subTest(sample=sample_path):
-                parsed_report = cast(
-                    AggregateReport,
-                    parsedmarc.parse_report_file(
-                        sample_path, always_use_local_files=True, offline=OFFLINE_MODE
-                    )["report"],
-                )
-                parsedmarc.parsed_aggregate_reports_to_csv(parsed_report)
-            print("Passed!")
-
-    def testParseReportFileWithBytes(self):
-        """parse_report_file handles bytes input"""
-        with open("samples/aggregate/rfc9990-sample.xml", "rb") as f:
-            data = f.read()
-        result = parsedmarc.parse_report_file(
-            data, always_use_local_files=True, offline=True
-        )
-        self.assertEqual(result["report_type"], "aggregate")
-
-    def testFailureReportCsvRoundtrip(self):
-        """Failure report CSV generation works on sample reports"""
-        print()
-        sample_paths = glob("samples/failure/*.eml")
-        for sample_path in sample_paths:
-            print("Testing CSV for {0}: ".format(sample_path), end="")
-            with self.subTest(sample=sample_path):
-                parsed_report = cast(
-                    FailureReport,
-                    parsedmarc.parse_report_file(sample_path, offline=OFFLINE_MODE)[
-                        "report"
-                    ],
-                )
-                csv_output = parsedmarc.parsed_failure_reports_to_csv(parsed_report)
-                self.assertIsNotNone(csv_output)
-                self.assertIn(",", csv_output)
-                rows = parsedmarc.parsed_failure_reports_to_csv_rows(parsed_report)
-                self.assertTrue(len(rows) > 0)
-            print("Passed!")
-
-
-class TestLoadPSLOverrides(unittest.TestCase):
-    """Covers `parsedmarc.utils.load_psl_overrides`."""
-
-    def setUp(self):
-        # Snapshot the module-level list so each test leaves it as it found it.
-        self._saved = list(parsedmarc.utils.psl_overrides)
-
-    def tearDown(self):
-        parsedmarc.utils.psl_overrides.clear()
-        parsedmarc.utils.psl_overrides.extend(self._saved)
-
-    def test_offline_loads_bundled_file(self):
-        """offline=True populates the list from the bundled file, no network."""
-        result = parsedmarc.utils.load_psl_overrides(offline=True)
-        self.assertIs(result, parsedmarc.utils.psl_overrides)
-        self.assertGreater(len(result), 0)
-        # The bundled file is expected to contain at least one well-known entry.
-        self.assertIn(".linode.com", result)
-
-    def test_local_file_path_overrides_bundled(self):
-        """A custom local_file_path takes precedence over the bundled copy."""
-        with tempfile.NamedTemporaryFile(
-            "w", suffix=".txt", delete=False, encoding="utf-8"
-        ) as tf:
-            tf.write("-custom-brand.com\n.another-brand.net\n\n   \n")
-            path = tf.name
-        try:
-            result = parsedmarc.utils.load_psl_overrides(
-                offline=True, local_file_path=path
-            )
-            self.assertEqual(result, ["-custom-brand.com", ".another-brand.net"])
-        finally:
-            os.unlink(path)
-
-    def test_clear_before_reload(self):
-        """Re-running load_psl_overrides replaces the list, not appends."""
-        parsedmarc.utils.psl_overrides.clear()
-        parsedmarc.utils.psl_overrides.append(".stale-entry.com")
-        parsedmarc.utils.load_psl_overrides(offline=True)
-        self.assertNotIn(".stale-entry.com", parsedmarc.utils.psl_overrides)
-
-    def test_url_success(self):
-        """A 200 response from the URL populates the list."""
-        fake_body = "-fetched-brand.com\n.cdn-fetched.net\n"
-        mock_response = MagicMock()
-        mock_response.text = fake_body
-        mock_response.raise_for_status = MagicMock()
-        with patch(
-            "parsedmarc.utils.requests.get", return_value=mock_response
-        ) as mock_get:
-            result = parsedmarc.utils.load_psl_overrides(url="https://example.test/ov")
-            self.assertEqual(result, ["-fetched-brand.com", ".cdn-fetched.net"])
-            mock_get.assert_called_once()
-
-    def test_url_failure_falls_back_to_local(self):
-        """A network error falls back to the bundled copy."""
-        import requests
-
-        with patch(
-            "parsedmarc.utils.requests.get",
-            side_effect=requests.exceptions.ConnectionError("nope"),
-        ):
-            result = parsedmarc.utils.load_psl_overrides(url="https://example.test/ov")
-        # Bundled file still loaded.
-        self.assertGreater(len(result), 0)
-        self.assertIn(".linode.com", result)
-
-    def test_always_use_local_skips_network(self):
-        """always_use_local_file=True must not call requests.get."""
-        with patch("parsedmarc.utils.requests.get") as mock_get:
-            parsedmarc.utils.load_psl_overrides(always_use_local_file=True)
-            mock_get.assert_not_called()
-
-
-class TestLoadReverseDnsMapReloadsPSLOverrides(unittest.TestCase):
-    """`load_reverse_dns_map` must reload `psl_overrides.txt` in the same call
-    so map entries that depend on folded bases resolve correctly."""
-
-    def setUp(self):
-        self._saved = list(parsedmarc.utils.psl_overrides)
-
-    def tearDown(self):
-        parsedmarc.utils.psl_overrides.clear()
-        parsedmarc.utils.psl_overrides.extend(self._saved)
-
-    def test_map_load_triggers_psl_reload(self):
-        """Calling load_reverse_dns_map offline also invokes load_psl_overrides
-        with matching flags, and the overrides list is repopulated."""
-        rdm = {}
-        parsedmarc.utils.psl_overrides.clear()
-        parsedmarc.utils.psl_overrides.append(".stale-from-before.com")
-        with patch(
-            "parsedmarc.utils.load_psl_overrides",
-            wraps=parsedmarc.utils.load_psl_overrides,
-        ) as spy:
-            parsedmarc.utils.load_reverse_dns_map(rdm, offline=True)
-        spy.assert_called_once()
-        kwargs = spy.call_args.kwargs
-        self.assertTrue(kwargs["offline"])
-        self.assertIsNone(kwargs["url"])
-        self.assertIsNone(kwargs["local_file_path"])
-        self.assertNotIn(".stale-from-before.com", parsedmarc.utils.psl_overrides)
-
-    def test_map_load_forwards_psl_overrides_kwargs(self):
-        """psl_overrides_path / psl_overrides_url are forwarded verbatim."""
-        rdm = {}
-        with patch("parsedmarc.utils.load_psl_overrides") as spy:
-            parsedmarc.utils.load_reverse_dns_map(
-                rdm,
-                offline=True,
-                always_use_local_file=True,
-                psl_overrides_path="/tmp/custom.txt",
-                psl_overrides_url="https://example.test/ov",
-            )
-        spy.assert_called_once_with(
-            always_use_local_file=True,
-            local_file_path="/tmp/custom.txt",
-            url="https://example.test/ov",
-            offline=True,
-        )
-
-
-class TestGetBaseDomainWithOverrides(unittest.TestCase):
-    """`get_base_domain` must honour the current psl_overrides list."""
-
-    def setUp(self):
-        self._saved = list(parsedmarc.utils.psl_overrides)
-        parsedmarc.utils.psl_overrides.clear()
-        parsedmarc.utils.psl_overrides.extend([".cprapid.com", "-nobre.com.br"])
-
-    def tearDown(self):
-        parsedmarc.utils.psl_overrides.clear()
-        parsedmarc.utils.psl_overrides.extend(self._saved)
-
-    def test_dot_prefixed_override_folds_subdomain(self):
-        result = parsedmarc.utils.get_base_domain("74-208-244-234.cprapid.com")
-        self.assertEqual(result, "cprapid.com")
-
-    def test_dash_prefixed_override_folds_subdomain(self):
-        result = parsedmarc.utils.get_base_domain("host-1-2-3-4-nobre.com.br")
-        self.assertEqual(result, "nobre.com.br")
-
-    def test_unmatched_domain_falls_through_to_psl(self):
-        result = parsedmarc.utils.get_base_domain("sub.example.com")
-        self.assertEqual(result, "example.com")
-
-
-class TestExtractReport(unittest.TestCase):
-    """Tests for parsedmarc.extract_report()"""
-
-    def testExtractReportFromBytes(self):
-        """extract_report handles raw XML bytes"""
-        xml = b'<?xml version="1.0"?><feedback><report_metadata></report_metadata></feedback>'
-        result = parsedmarc.extract_report(xml)
-        self.assertIn("<feedback>", result)
-
-    def testExtractReportFromBase64Xml(self):
-        """extract_report handles base64-encoded XML string"""
-        import base64
-
-        xml = b'<?xml version="1.0"?><feedback></feedback>'
-        b64 = base64.b64encode(xml).decode()
-        result = parsedmarc.extract_report(b64)
-        self.assertIn("<feedback>", result)
-
-    def testExtractReportFromGzip(self):
-        """extract_report handles gzip compressed content"""
-        import gzip
-
-        xml = b'<?xml version="1.0"?><feedback></feedback>'
-        compressed = gzip.compress(xml)
-        result = parsedmarc.extract_report(compressed)
-        self.assertIn("<feedback>", result)
-
-    def testExtractReportFromZip(self):
-        """extract_report handles zip compressed content"""
-        import zipfile
-
-        xml = b'<?xml version="1.0"?><feedback></feedback>'
-        buf = BytesIO()
-        with zipfile.ZipFile(buf, "w") as zf:
-            zf.writestr("report.xml", xml)
-        result = parsedmarc.extract_report(buf.getvalue())
-        self.assertIn("<feedback>", result)
-
-    def testExtractReportFromBinaryIO(self):
-        """extract_report handles file-like BinaryIO objects"""
-        xml = b'<?xml version="1.0"?><feedback></feedback>'
-        bio = BytesIO(xml)
-        result = parsedmarc.extract_report(bio)
-        self.assertIn("<feedback>", result)
-
-    def testExtractReportFromNonSeekableStream(self):
-        """extract_report handles non-seekable streams"""
-        xml = b'<?xml version="1.0"?><feedback></feedback>'
-
-        class NonSeekable:
-            def __init__(self, data):
-                self._data = data
-                self._pos = 0
-
-            def read(self, n=-1):
-                if n == -1:
-                    result = self._data[self._pos :]
-                    self._pos = len(self._data)
-                else:
-                    result = self._data[self._pos : self._pos + n]
-                    self._pos += n
-                return result
-
-            def seekable(self):
-                return False
-
-            def close(self):
-                pass
-
-        result = parsedmarc.extract_report(cast(BinaryIO, NonSeekable(xml)))
-        self.assertIn("<feedback>", result)
-
-    def testExtractReportInvalidContent(self):
-        """extract_report raises ParserError for invalid content"""
-        with self.assertRaises(parsedmarc.ParserError):
-            parsedmarc.extract_report(b"this is not a valid archive")
-
-    def testExtractReportTextModeRaises(self):
-        """extract_report raises ParserError for text-mode streams"""
-
-        class TextStream:
-            def read(self, n=-1):
-                return "text data"
-
-            def seekable(self):
-                return True
-
-            def seek(self, pos):
-                pass
-
-            def close(self):
-                pass
-
-        with self.assertRaises(parsedmarc.ParserError):
-            parsedmarc.extract_report(cast(BinaryIO, TextStream()))
-
-
-class TestMalformedXmlRecovery(unittest.TestCase):
-    """Tests for XML recovery in parse_aggregate_report_xml"""
-
-    def testRecoversMalformedXml(self):
-        """Malformed XML triggers recovery path and still parses"""
-        # XML with a broken tag that xmltodict will reject but lxml can recover
-        malformed_xml = """<?xml version="1.0"?>
-<feedback>
-  <report_metadata>
-    <org_name>example.com</org_name>
-    <email>dmarc@example.com</email>
-    <report_id>12345</report_id>
-    <date_range><begin>1680000000</begin><end>1680086400</end></date_range>
-  </report_metadata>
-  <policy_published>
-    <domain>example.com</domain><p>none</p>
-  </policy_published>
-  <record>
-    <row><source_ip>203.0.113.1</source_ip><count>1</count>
-      <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
-    </row>
-    <identifiers><header_from>example.com</header_from></identifiers>
-    <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
-  </record>
-  <broken_tag
-</feedback>"""
-        # lxml recovery may succeed or fail depending on how broken the XML is
-        # Either way, no unhandled exception should escape
-        try:
-            report = parsedmarc.parse_aggregate_report_xml(malformed_xml, offline=True)
-            self.assertIn("report_metadata", report)
-        except parsedmarc.InvalidAggregateReport:
-            pass  # Also acceptable
-
-    def testBytesXmlInput(self):
-        """XML bytes input is decoded"""
-        xml = b"""<?xml version="1.0"?>
-<feedback>
-  <report_metadata>
-    <org_name>example.com</org_name>
-    <email>dmarc@example.com</email>
-    <report_id>test-bytes-input</report_id>
-    <date_range><begin>1680000000</begin><end>1680086400</end></date_range>
-  </report_metadata>
-  <policy_published>
-    <domain>example.com</domain><p>none</p>
-  </policy_published>
-  <record>
-    <row><source_ip>203.0.113.1</source_ip><count>1</count>
-      <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
-    </row>
-    <identifiers><header_from>example.com</header_from></identifiers>
-    <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
-  </record>
-</feedback>"""
-        report = parsedmarc.parse_aggregate_report_xml(xml.decode(), offline=True)
-        self.assertEqual(report["report_metadata"]["report_id"], "test-bytes-input")
-
-    def testExpatErrorRaises(self):
-        """Completely invalid XML raises InvalidAggregateReport"""
-        with self.assertRaises(parsedmarc.InvalidAggregateReport):
-            parsedmarc.parse_aggregate_report_xml("not xml at all {}", offline=True)
-
-    def testMissingOrgName(self):
-        """Missing org_name raises InvalidAggregateReport"""
-        xml = """<?xml version="1.0"?>
-<feedback>
-  <report_metadata>
-    <email>dmarc@example.com</email>
-    <report_id>missing-org</report_id>
-    <date_range><begin>1680000000</begin><end>1680086400</end></date_range>
-  </report_metadata>
-  <policy_published><domain>example.com</domain><p>none</p></policy_published>
-  <record>
-    <row><source_ip>1.2.3.4</source_ip><count>1</count>
-      <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
-    </row>
-    <identifiers><header_from>example.com</header_from></identifiers>
-    <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
-  </record>
-</feedback>"""
-        with self.assertRaises(parsedmarc.InvalidAggregateReport):
-            parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-
-
-class TestPolicyPublishedEdgeCases(unittest.TestCase):
-    """Tests for edge cases in policy_published parsing"""
-
-    VALID_XML_TEMPLATE = """<?xml version="1.0"?>
-<feedback>
-  <report_metadata>
-    <org_name>example.com</org_name>
-    <email>dmarc@example.com</email>
-    <report_id>test-{tag}</report_id>
-    <date_range><begin>1680000000</begin><end>1680086400</end></date_range>
-    {extra_metadata}
-  </report_metadata>
-  <policy_published>
-    <domain>example.com</domain><p>reject</p>
-    {policy_extra}
-  </policy_published>
-  <record>
-    <row><source_ip>203.0.113.1</source_ip><count>1</count>
-      <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
-    </row>
-    <identifiers><header_from>example.com</header_from></identifiers>
-    <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
-  </record>
-</feedback>"""
-
-    def _parse(self, tag="default", policy_extra="", extra_metadata=""):
-        xml = self.VALID_XML_TEMPLATE.format(
-            tag=tag, policy_extra=policy_extra, extra_metadata=extra_metadata
-        )
-        return parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-
-    def testPolicyPublishedListHandled(self):
-        """policy_published as a list uses first element"""
-        # The code checks `if type(policy_published) is list`
-        # This is tested implicitly when xmltodict returns a list;
-        # we test via the np field presence
-        report = self._parse(tag="np", policy_extra="<np>quarantine</np>")
-        self.assertEqual(report["policy_published"]["np"], "quarantine")
-
-    def testNpFieldValues(self):
-        """np field is parsed correctly"""
-        for val in ["none", "quarantine", "reject"]:
-            report = self._parse(tag=f"np-{val}", policy_extra=f"<np>{val}</np>")
-            self.assertEqual(report["policy_published"]["np"], val)
-
-    def testTestingField(self):
-        """testing field is parsed correctly"""
-        for val in ["y", "n"]:
-            report = self._parse(
-                tag=f"testing-{val}", policy_extra=f"<testing>{val}</testing>"
-            )
-            self.assertEqual(report["policy_published"]["testing"], val)
-
-    def testDiscoveryMethodField(self):
-        """discovery_method field is parsed correctly"""
-        for val in ["psl", "treewalk"]:
-            report = self._parse(
-                tag=f"disc-{val}",
-                policy_extra=f"<discovery_method>{val}</discovery_method>",
-            )
-            self.assertEqual(report["policy_published"]["discovery_method"], val)
-
-    def testGeneratorField(self):
-        """generator field in report_metadata is parsed"""
-        report = self._parse(
-            tag="gen", extra_metadata="<generator>TestGen/1.0</generator>"
-        )
-        self.assertEqual(report["report_metadata"]["generator"], "TestGen/1.0")
-
-    def testPctFieldNone(self):
-        """pct defaults to None when absent (removed in RFC 9989)"""
-        report = self._parse(tag="no-pct")
-        self.assertIsNone(report["policy_published"]["pct"])
-
-    def testFoFieldNone(self):
-        """fo defaults to None when absent (RFC 9990 keeps it optional)"""
-        report = self._parse(tag="no-fo")
-        self.assertIsNone(report["policy_published"]["fo"])
-
-    def testReportMetadataErrors(self):
-        """Report metadata errors are captured"""
-        report = self._parse(
-            tag="errors",
-            extra_metadata="<error>DNS timeout</error>",
-        )
-        self.assertIn("DNS timeout", report["report_metadata"]["errors"])
-
-    def testReportMetadataErrorsList(self):
-        """Report metadata errors as list are captured"""
-        report = self._parse(
-            tag="errors-list",
-            extra_metadata="<error>error1</error><error>error2</error>",
-        )
-        self.assertIn("error1", report["report_metadata"]["errors"])
-        self.assertIn("error2", report["report_metadata"]["errors"])
-
-    def testRecordParseFailureSkipped(self):
-        """Bad records are skipped with a warning, not crashing"""
-        xml = """<?xml version="1.0"?>
-<feedback>
-  <report_metadata>
-    <org_name>example.com</org_name>
-    <email>dmarc@example.com</email>
-    <report_id>bad-records</report_id>
-    <date_range><begin>1680000000</begin><end>1680086400</end></date_range>
-  </report_metadata>
-  <policy_published><domain>example.com</domain><p>none</p></policy_published>
-  <record>
-    <row><source_ip>203.0.113.1</source_ip><count>1</count>
-      <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
-    </row>
-    <identifiers><header_from>example.com</header_from></identifiers>
-    <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
-  </record>
-  <record>
-    <row><source_ip>bad-ip</source_ip><count>not-a-number</count>
-      <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
-    </row>
-    <identifiers><header_from>example.com</header_from></identifiers>
-    <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
-  </record>
-</feedback>"""
-        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
-        # At least the valid record should be parsed
-        self.assertTrue(len(report["records"]) >= 1)
-
-
-class TestParseReportFile(unittest.TestCase):
-    """Tests for parse_report_file with various input types"""
-
-    def testParseReportFileFromBytes(self):
-        """parse_report_file works with bytes input"""
-        xml_path = "samples/aggregate/!example.com!1538204542!1538463818.xml"
-        with open(xml_path, "rb") as f:
-            content = f.read()
-        result = parsedmarc.parse_report_file(content, offline=True)
-        self.assertEqual(result["report_type"], "aggregate")
-
-    def testParseReportFileFromBinaryIO(self):
-        """parse_report_file works with BinaryIO input"""
-        xml_path = "samples/aggregate/!example.com!1538204542!1538463818.xml"
-        with open(xml_path, "rb") as f:
-            result = parsedmarc.parse_report_file(f, offline=True)
-        self.assertEqual(result["report_type"], "aggregate")
-
-    def testParseReportFileFromPathlib(self):
-        """parse_report_file works with pathlib.Path input"""
-        xml_path = Path("samples/aggregate/!example.com!1538204542!1538463818.xml")
-        result = parsedmarc.parse_report_file(xml_path, offline=True)
-        self.assertEqual(result["report_type"], "aggregate")
-
-    def testParseReportFileSmtpTls(self):
-        """parse_report_file detects SMTP TLS reports"""
-        result = parsedmarc.parse_report_file(
-            "samples/smtp_tls/smtp_tls.json", offline=True
-        )
-        self.assertEqual(result["report_type"], "smtp_tls")
-
-    def testParseReportFileEmail(self):
-        """parse_report_file detects failure reports in email format"""
-        eml_path = "samples/failure/dmarc_ruf_report_linkedin.eml"
-        result = parsedmarc.parse_report_file(eml_path, offline=True)
-        self.assertEqual(result["report_type"], "failure")
-
-    def testParseReportFileInvalid(self):
-        """parse_report_file raises ParserError for invalid content"""
-        with self.assertRaises(parsedmarc.ParserError):
-            parsedmarc.parse_report_file(b"this is not a report", offline=True)
-
-
-class TestParseReportEmail(unittest.TestCase):
-    """Tests for parse_report_email edge cases"""
-
-    def testSmtpTlsEmailReport(self):
-        """parse_report_email handles SMTP TLS reports in email format"""
-        eml_path = "samples/smtp_tls/google.com_smtp_tls_report.eml"
-        with open(eml_path, "rb") as f:
-            content = f.read()
-        result = parsedmarc.parse_report_email(content, offline=True)
-        self.assertEqual(result["report_type"], "smtp_tls")
-
-    def testInvalidEmailRaisesError(self):
-        """parse_report_email raises error for non-DMARC email"""
-        email_str = """From: test@example.com
-Subject: Hello World
-Content-Type: text/plain
-
-This is not a DMARC report."""
-        with self.assertRaises(parsedmarc.InvalidDMARCReport):
-            parsedmarc.parse_report_email(email_str, offline=True)
-
-
-class TestFailureReportParsing(unittest.TestCase):
-    """Tests for failure report field defaults and edge cases"""
-
-    def _make_feedback_report(self, **overrides):
-        """Create a minimal feedback report string"""
-        fields = {
-            "Feedback-Type": "auth-failure",
-            "User-Agent": "test/1.0",
-            "Version": "1",
-            "Original-Mail-From": "sender@example.com",
-            "Arrival-Date": "Thu, 1 Jan 2024 00:00:00 +0000",
-            "Source-IP": "203.0.113.1",
-            "Reported-Domain": "example.com",
-            "Auth-Failure": "dmarc",
-        }
-        fields.update(overrides)
-        return "\n".join(f"{k}: {v}" for k, v in fields.items())
-
-    def _make_sample(self):
-        return """From: sender@example.com
-To: recipient@example.com
-Subject: Test
-Date: Thu, 1 Jan 2024 00:00:00 +0000
-
-Test body"""
-
-    def _default_msg_date(self):
-        return datetime(2024, 1, 1, 0, 0, 0, tzinfo=timezone.utc)
-
-    def testMissingVersion(self):
-        """Missing version defaults to None"""
-        report_str = self._make_feedback_report()
-        lines = [ln for ln in report_str.split("\n") if not ln.startswith("Version:")]
-        report_str = "\n".join(lines)
-        report = parsedmarc.parse_failure_report(
-            report_str, self._make_sample(), self._default_msg_date(), offline=True
-        )
-        self.assertIsNone(report["version"])
-
-    def testMissingUserAgent(self):
-        """Missing user_agent defaults to None"""
-        report_str = self._make_feedback_report()
-        lines = [
-            ln for ln in report_str.split("\n") if not ln.startswith("User-Agent:")
-        ]
-        report_str = "\n".join(lines)
-        report = parsedmarc.parse_failure_report(
-            report_str, self._make_sample(), self._default_msg_date(), offline=True
-        )
-        self.assertIsNone(report["user_agent"])
-
-    def testMissingDeliveryResult(self):
-        """Missing delivery_result maps to 'other' when field absent"""
-        report_str = self._make_feedback_report()
-        report = parsedmarc.parse_failure_report(
-            report_str, self._make_sample(), self._default_msg_date(), offline=True
-        )
-        # When delivery_result is not in the parsed report, it's set to None,
-        # but then the validation check maps None (not in delivery_results list) to "other"
-        self.assertEqual(report["delivery_result"], "other")
-
-    def testDeliveryResultMapped(self):
-        """Known delivery_result values are mapped correctly"""
-        for val in ["delivered", "spam", "policy", "reject"]:
-            report_str = self._make_feedback_report(**{"Delivery-Result": val})
-            report = parsedmarc.parse_failure_report(
-                report_str, self._make_sample(), self._default_msg_date(), offline=True
-            )
-            self.assertEqual(report["delivery_result"], val)
-
-    def testDeliveryResultUnknownMapsToOther(self):
-        """Unknown delivery_result maps to 'other'"""
-        report_str = self._make_feedback_report(**{"Delivery-Result": "unknown-value"})
-        report = parsedmarc.parse_failure_report(
-            report_str, self._make_sample(), self._default_msg_date(), offline=True
-        )
-        self.assertEqual(report["delivery_result"], "other")
-
-    def testIdentityAlignmentNone(self):
-        """identity_alignment='none' results in empty auth mechanisms"""
-        report_str = self._make_feedback_report(**{"Identity-Alignment": "none"})
-        report = parsedmarc.parse_failure_report(
-            report_str, self._make_sample(), self._default_msg_date(), offline=True
-        )
-        self.assertEqual(report["authentication_mechanisms"], [])
-
-    def testIdentityAlignmentMultiple(self):
-        """identity_alignment with multiple values is split"""
-        report_str = self._make_feedback_report(**{"Identity-Alignment": "dkim,spf"})
-        report = parsedmarc.parse_failure_report(
-            report_str, self._make_sample(), self._default_msg_date(), offline=True
-        )
-        self.assertEqual(report["authentication_mechanisms"], ["dkim", "spf"])
-
-    def testIdentityAlignmentCFWSWhitespaceStripped(self):
-        """RFC 9991 ABNF allows CFWS around the commas in
-        Identity-Alignment. The previous parser left leading whitespace
-        on the second token ('dkim, spf' -> ['dkim', ' spf']); CFWS-aware
-        splitting yields ['dkim', 'spf']."""
-        report_str = self._make_feedback_report(**{"Identity-Alignment": "dkim, spf"})
-        report = parsedmarc.parse_failure_report(
-            report_str, self._make_sample(), self._default_msg_date(), offline=True
-        )
-        self.assertEqual(report["authentication_mechanisms"], ["dkim", "spf"])
-
-    def testAuthFailureCFWSWhitespaceStripped(self):
-        """Auth-Failure (also comma-separated per RFC 9991) is whitespace-
-        stripped per token."""
-        report_str = self._make_feedback_report(**{"Auth-Failure": "dmarc, spf"})
-        report = parsedmarc.parse_failure_report(
-            report_str, self._make_sample(), self._default_msg_date(), offline=True
-        )
-        self.assertEqual(report["auth_failure"], ["dmarc", "spf"])
-
-    def testMissingIdentityAlignmentWarns(self):
-        """Identity-Alignment is REQUIRED per RFC 9991; the parser
-        defaults silently for permissiveness but logs a warning so the
-        broken reporter is visible."""
-        report_str = self._make_feedback_report()
-        lines = [
-            ln
-            for ln in report_str.split("\n")
-            if not ln.startswith("Identity-Alignment:")
-        ]
-        report_str = "\n".join(lines)
-        with self.assertLogs("parsedmarc.log", level="WARNING") as cm:
-            report = parsedmarc.parse_failure_report(
-                report_str,
-                self._make_sample(),
-                self._default_msg_date(),
-                offline=True,
-            )
-        self.assertEqual(report["authentication_mechanisms"], [])
-        self.assertTrue(
-            any("Identity-Alignment" in m and "RFC 9991" in m for m in cm.output),
-            f"Expected Identity-Alignment RFC 9991 warning; got: {cm.output}",
-        )
-
-    def testMissingAuthFailureWarns(self):
-        """Auth-Failure is REQUIRED per RFC 9991; the parser defaults
-        to 'dmarc' but logs a warning."""
-        report_str = self._make_feedback_report()
-        lines = [
-            ln for ln in report_str.split("\n") if not ln.startswith("Auth-Failure:")
-        ]
-        report_str = "\n".join(lines)
-        with self.assertLogs("parsedmarc.log", level="WARNING") as cm:
-            report = parsedmarc.parse_failure_report(
-                report_str,
-                self._make_sample(),
-                self._default_msg_date(),
-                offline=True,
-            )
-        self.assertEqual(report["auth_failure"], ["dmarc"])
-        self.assertTrue(
-            any("Auth-Failure" in m and "RFC 9991" in m for m in cm.output),
-            f"Expected Auth-Failure RFC 9991 warning; got: {cm.output}",
-        )
-
-    def testMissingReportedDomainFallback(self):
-        """Missing reported_domain falls back to sample from domain"""
-        report_str = self._make_feedback_report()
-        lines = [
-            ln for ln in report_str.split("\n") if not ln.startswith("Reported-Domain:")
-        ]
-        report_str = "\n".join(lines)
-        report = parsedmarc.parse_failure_report(
-            report_str, self._make_sample(), self._default_msg_date(), offline=True
-        )
-        self.assertEqual(report["reported_domain"], "example.com")
-
-    def testMissingArrivalDateWithMsgDate(self):
-        """Missing arrival_date uses msg_date fallback"""
-        report_str = self._make_feedback_report()
-        lines = [
-            ln for ln in report_str.split("\n") if not ln.startswith("Arrival-Date:")
-        ]
-        report_str = "\n".join(lines)
-        msg_date = datetime(2024, 6, 15, 12, 0, 0, tzinfo=timezone.utc)
-        report = parsedmarc.parse_failure_report(
-            report_str, self._make_sample(), msg_date, offline=True
-        )
-        self.assertIn("2024-06-15", report["arrival_date"])
-
-    def testMissingArrivalDateNoMsgDateRaises(self):
-        """Missing arrival_date with no msg_date raises"""
-        report_str = self._make_feedback_report()
-        lines = [
-            ln for ln in report_str.split("\n") if not ln.startswith("Arrival-Date:")
-        ]
-        report_str = "\n".join(lines)
-        with self.assertRaises(parsedmarc.InvalidFailureReport):
-            parsedmarc.parse_failure_report(
-                report_str,
-                self._make_sample(),
-                cast(datetime, None),  # intentionally None to test error path
-                offline=True,
-            )
-
-
-class TestWebhookClient(unittest.TestCase):
-    """Tests for webhook client initialization and close"""
-
-    def testClose(self):
-        """WebhookClient.close() closes session"""
-        client = parsedmarc.webhook.WebhookClient(
-            aggregate_url="http://invalid.test/agg",
-            failure_url="http://invalid.test/fail",
-            smtp_tls_url="http://invalid.test/tls",
-        )
-        mock_close = MagicMock()
-        client.session.close = mock_close
-        client.close()
-        mock_close.assert_called_once()
-
-
-class TestUtilsDnsCaching(unittest.TestCase):
-    """Tests for DNS query caching and reverse DNS error handling"""
-
-    def testQueryDnsUsesCacheHit(self):
-        """query_dns returns cached result without making DNS query"""
-        cache = ExpiringDict(max_len=100, max_age_seconds=60)
-        cache["example.com_A"] = ["1.2.3.4"]
-        result = parsedmarc.utils.query_dns("example.com", "A", cache=cache)
-        self.assertEqual(result, ["1.2.3.4"])
-
-    def testQueryDnsCachesResult(self):
-        """query_dns stores result in cache when cache is non-empty"""
-        cache = ExpiringDict(max_len=100, max_age_seconds=60)
-        # Pre-populate so ExpiringDict is truthy
-        cache["seed_key"] = ["seed"]
-        mock_record = MagicMock()
-        mock_record.to_text.return_value = '"1.2.3.4"'
-        mock_resolver = MagicMock()
-        mock_resolver.resolve.return_value = [mock_record]
-        with patch(
-            "parsedmarc.utils.dns.resolver.Resolver", return_value=mock_resolver
-        ):
-            result = parsedmarc.utils.query_dns(
-                "test-cache.example.com", "A", cache=cache
-            )
-            self.assertEqual(result, ["1.2.3.4"])
-            self.assertIn("test-cache.example.com_A", cache)
-
-    def testReverseDnsReturnsNoneOnFailure(self):
-        """get_reverse_dns returns None on DNS exceptions"""
-        with patch(
-            "parsedmarc.utils.query_dns",
-            side_effect=dns.exception.DNSException("timeout"),
-        ):
-            result = parsedmarc.utils.get_reverse_dns("203.0.113.1")
-            self.assertIsNone(result)
-
-
-class TestUtilsIpDbPaths(unittest.TestCase):
-    """Tests for IP database path validation"""
-
-    def testCustomPathFallsBack(self):
-        """Non-existent custom db path falls back to default"""
-        result = parsedmarc.utils.get_ip_address_country(
-            "1.1.1.1", db_path="/nonexistent/path.mmdb"
-        )
-        self.assertTrue(result is None or isinstance(result, str))
-
-    def testBundledDbWorks(self):
-        """Bundled IP database returns results"""
-        result = parsedmarc.utils.get_ip_address_country("8.8.8.8")
-        self.assertEqual(result, "US")
-
-
-class TestUtilsParseEmail(unittest.TestCase):
-    """Tests for parse_email edge cases"""
-
-    def testMinimalEmail(self):
-        """parse_email handles email with minimal headers"""
-        email_str = """From: test@example.com
-Subject: Test
-
-Body text"""
-        result = parsedmarc.utils.parse_email(email_str)
-        self.assertEqual(result["subject"], "Test")
-        self.assertEqual(result["reply_to"], [])
-
-    def testEmailWithNoSubject(self):
-        """parse_email defaults subject to None when missing"""
-        email_str = """From: test@example.com
-To: other@example.com
-
-Body"""
-        result = parsedmarc.utils.parse_email(email_str)
-        self.assertIsNone(result["subject"])
-
-    def testEmailBytesInput(self):
-        """parse_email handles bytes input"""
-        email_bytes = b"""From: test@example.com
-Subject: Bytes Test
-To: other@example.com
-
-Body"""
-        result = parsedmarc.utils.parse_email(email_bytes)
-        self.assertEqual(result["subject"], "Bytes Test")
-
-    def testEmailWithAttachments(self):
-        """parse_email with strip_attachment_payloads removes payloads"""
-        from email.mime.multipart import MIMEMultipart
-        from email.mime.text import MIMEText
-        from email.mime.base import MIMEBase
-        from email import encoders
-
-        msg = MIMEMultipart()
-        msg["From"] = "test@example.com"
-        msg["To"] = "other@example.com"
-        msg["Subject"] = "Attachment Test"
-        msg.attach(MIMEText("Body text"))
-
-        attachment = MIMEBase("application", "octet-stream")
-        attachment.set_payload(b"file content here")
-        encoders.encode_base64(attachment)
-        attachment.add_header("Content-Disposition", "attachment", filename="test.bin")
-        msg.attach(attachment)
-
-        result = parsedmarc.utils.parse_email(
-            msg.as_string(), strip_attachment_payloads=True
-        )
-        for att in result["attachments"]:
-            self.assertNotIn("payload", att)
-
-
-class TestUtilsOutlookMsg(unittest.TestCase):
-    """Tests for Outlook MSG detection and conversion"""
-
-    def testIsOutlookMsg(self):
-        """is_outlook_msg detects MSG magic bytes"""
-        msg_magic = b"\xd0\xcf\x11\xe0\xa1\xb1\x1a\xe1" + b"\x00" * 100
-        self.assertTrue(parsedmarc.utils.is_outlook_msg(msg_magic))
-
-    def testIsNotOutlookMsg(self):
-        """is_outlook_msg rejects non-MSG content"""
-        self.assertFalse(parsedmarc.utils.is_outlook_msg(b"not an msg file"))
-        self.assertFalse(parsedmarc.utils.is_outlook_msg("string input"))
-
-    def testConvertOutlookMsgInvalidInput(self):
-        """convert_outlook_msg raises ValueError for non-MSG bytes"""
-        with self.assertRaises(ValueError):
-            parsedmarc.utils.convert_outlook_msg(b"not an msg file")
-
-
-class TestUtilsReverseDnsMap(unittest.TestCase):
-    """Tests for reverse DNS map loading"""
-
-    def testLoadReverseDnsMapOffline(self):
-        """load_reverse_dns_map in offline mode loads bundled map"""
-        rdns_map = {}
-        parsedmarc.utils.load_reverse_dns_map(rdns_map, offline=True)
-        self.assertTrue(len(rdns_map) > 0)
-
-    def testLoadReverseDnsMapLocalOverride(self):
-        """load_reverse_dns_map uses local_file_path when provided"""
-        with NamedTemporaryFile("w", suffix=".csv", delete=False) as f:
-            f.write("base_reverse_dns,name,type\n")
-            f.write("custom.example.com,Custom Service,hosting\n")
-            path = f.name
-        try:
-            rdns_map = {}
-            parsedmarc.utils.load_reverse_dns_map(
-                rdns_map, offline=True, local_file_path=path
-            )
-            self.assertIn("custom.example.com", rdns_map)
-            self.assertEqual(rdns_map["custom.example.com"]["name"], "Custom Service")
-        finally:
-            os.remove(path)
-
-    def testLoadReverseDnsMapNetworkFailureFallback(self):
-        """load_reverse_dns_map falls back to bundled on network error"""
-        rdns_map = {}
-        with patch(
-            "parsedmarc.utils.requests.get",
-            side_effect=requests.exceptions.ConnectionError("no network"),
-        ):
-            parsedmarc.utils.load_reverse_dns_map(rdns_map)
-        self.assertTrue(len(rdns_map) > 0)
-
-
-class TestSmtpTlsReportErrors(unittest.TestCase):
-    """Tests for SMTP TLS report error handling"""
-
-    def testMissingRequiredField(self):
-        """Missing required field raises InvalidSMTPTLSReport"""
-        json_str = json.dumps({"policies": []})
-        with self.assertRaises(parsedmarc.InvalidSMTPTLSReport):
-            parsedmarc.parse_smtp_tls_report_json(json_str)
-
-    def testInvalidJson(self):
-        """Invalid JSON raises InvalidSMTPTLSReport"""
-        with self.assertRaises(parsedmarc.InvalidSMTPTLSReport):
-            parsedmarc.parse_smtp_tls_report_json("not json {{{")
-
-
-class TestBucketIntervalEdgeCases(unittest.TestCase):
-    """Tests for _bucket_interval_by_day edge cases"""
-
-    def testDayCursorAdjustment(self):
-        """When begin is before midnight due to tz, day_cursor adjusts back"""
-        # Use a timezone where midnight calculation might cause day_cursor > begin
-        import pytz
-
-        tz = pytz.FixedOffset(-600)  # UTC-10
-        begin = datetime(2024, 1, 1, 23, 30, 0, tzinfo=timezone.utc).astimezone(tz)
-        end = datetime(2024, 1, 3, 0, 0, 0, tzinfo=timezone.utc).astimezone(tz)
-        buckets = parsedmarc._bucket_interval_by_day(begin, end, 100)
-        total = sum(b["count"] for b in buckets)
-        self.assertEqual(total, 100)
-
-
-class TestGetDmarcReportsFromMbox(unittest.TestCase):
-    """Tests for mbox parsing"""
-
-    def testEmptyMbox(self):
-        """Empty mbox returns empty results"""
-        with NamedTemporaryFile(suffix=".mbox", delete=False) as f:
-            f.write(b"")
-            path = f.name
-        try:
-            results = parsedmarc.get_dmarc_reports_from_mbox(path, offline=True)
-            self.assertEqual(results["aggregate_reports"], [])
-            self.assertEqual(results["failure_reports"], [])
-            self.assertEqual(results["smtp_tls_reports"], [])
-        finally:
-            os.remove(path)
-
-    def testMboxWithAggregateReport(self):
-        """Mbox with aggregate report email is parsed"""
-        from email.mime.multipart import MIMEMultipart
-        from email.mime.application import MIMEApplication
-        import gzip
-
-        xml = b"""<?xml version="1.0"?>
-<feedback>
-  <report_metadata>
-    <org_name>example.com</org_name>
-    <email>dmarc@example.com</email>
-    <report_id>mbox-test-123</report_id>
-    <date_range><begin>1680000000</begin><end>1680086400</end></date_range>
-  </report_metadata>
-  <policy_published><domain>example.com</domain><p>none</p></policy_published>
-  <record>
-    <row><source_ip>203.0.113.1</source_ip><count>1</count>
-      <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
-    </row>
-    <identifiers><header_from>example.com</header_from></identifiers>
-    <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
-  </record>
-</feedback>"""
-        compressed = gzip.compress(xml)
-
-        msg = MIMEMultipart()
-        msg["From"] = "dmarc@example.com"
-        msg["To"] = "postmaster@example.com"
-        msg["Subject"] = "DMARC Aggregate Report"
-        msg["Date"] = "Thu, 1 Jan 2024 00:00:00 +0000"
-        att = MIMEApplication(compressed, "gzip")
-        att.add_header("Content-Disposition", "attachment", filename="report.xml.gz")
-        msg.attach(att)
-
-        with NamedTemporaryFile(suffix=".mbox", delete=False, mode="w") as f:
-            # mbox format requires "From " line
-            f.write("From dmarc@example.com Thu Jan  1 00:00:00 2024\n")
-            f.write(msg.as_string())
-            f.write("\n")
-            path = f.name
-        try:
-            results = parsedmarc.get_dmarc_reports_from_mbox(path, offline=True)
-            self.assertTrue(len(results["aggregate_reports"]) >= 1)
-        finally:
-            os.remove(path)
-
-
-class TestPslOverrides(unittest.TestCase):
-    """Tests for PSL override matching"""
-
-    def testOverrideMatch(self):
-        """PSL overrides are applied when domain ends with override"""
-        # psl_overrides contains entries; test that get_base_domain
-        # handles them without error
-        result = parsedmarc.utils.get_base_domain("sub.example.com")
-        self.assertEqual(result, "example.com")
-
-
-class TestMapScriptsIPDetection(unittest.TestCase):
-    """Full-IP detection and PSL folding in the map-maintenance scripts."""
-
-    def test_collect_domain_info_detects_full_ips(self):
-        import parsedmarc.resources.maps.collect_domain_info as cdi
-
-        # Dotted and dashed four-octet patterns with valid octets: detected.
-        self.assertTrue(cdi._has_full_ip("74-208-244-234.cprapid.com"))
-        self.assertTrue(cdi._has_full_ip("host.192.168.1.1.example.com"))
-        self.assertTrue(cdi._has_full_ip("a-10-20-30-40-brand.com"))
-        # Three octets is NOT a full IP — OVH's reverse-DNS pattern stays safe.
-        self.assertFalse(cdi._has_full_ip("ip-147-135-108.us"))
-        # Out-of-range octet fails the 0-255 sanity check.
-        self.assertFalse(cdi._has_full_ip("999-1-2-3-foo.com"))
-        # Pure domain, no IP.
-        self.assertFalse(cdi._has_full_ip("example.com"))
-
-    def test_find_unknown_detects_full_ips(self):
-        import parsedmarc.resources.maps.find_unknown_base_reverse_dns as fu
-
-        self.assertTrue(fu._has_full_ip("170-254-144-204-nobreinternet.com.br"))
-        self.assertFalse(fu._has_full_ip("ip-147-135-108.us"))
-        self.assertFalse(fu._has_full_ip("cprapid.com"))
-
-    def test_apply_psl_override_dot_prefix(self):
-        import parsedmarc.resources.maps.collect_domain_info as cdi
-
-        ov = [".cprapid.com", ".linode.com"]
-        self.assertEqual(cdi._apply_psl_override("foo.cprapid.com", ov), "cprapid.com")
-        self.assertEqual(cdi._apply_psl_override("a.b.linode.com", ov), "linode.com")
-
-    def test_apply_psl_override_dash_prefix(self):
-        import parsedmarc.resources.maps.collect_domain_info as cdi
-
-        ov = ["-nobre.com.br"]
-        self.assertEqual(
-            cdi._apply_psl_override("1-2-3-4-nobre.com.br", ov), "nobre.com.br"
-        )
-
-    def test_apply_psl_override_no_match(self):
-        import parsedmarc.resources.maps.collect_domain_info as cdi
-
-        ov = [".cprapid.com"]
-        self.assertEqual(cdi._apply_psl_override("example.com", ov), "example.com")
-
-
-class TestDetectPSLOverrides(unittest.TestCase):
-    """Cluster detection, brand-tail extraction, and full-pipeline behaviour
-    for `detect_psl_overrides.py`."""
-
-    def setUp(self):
-        import parsedmarc.resources.maps.detect_psl_overrides as dpo
-
-        self.dpo = dpo
-
-    def test_extract_brand_tail_dot_separator(self):
-        self.assertEqual(
-            self.dpo.extract_brand_tail("74-208-244-234.cprapid.com"),
-            ".cprapid.com",
-        )
-
-    def test_extract_brand_tail_dash_separator(self):
-        self.assertEqual(
-            self.dpo.extract_brand_tail("170-254-144-204-nobre.com.br"),
-            "-nobre.com.br",
-        )
-
-    def test_extract_brand_tail_no_separator(self):
-        self.assertEqual(
-            self.dpo.extract_brand_tail("host134-254-143-190tigobusiness.com.ni"),
-            "tigobusiness.com.ni",
-        )
-
-    def test_extract_brand_tail_no_ip_returns_none(self):
-        self.assertIsNone(self.dpo.extract_brand_tail("plain.example.com"))
-
-    def test_extract_brand_tail_rejects_short_tail(self):
-        """A tail shorter than MIN_TAIL_LEN is rejected to avoid folding to `.com`."""
-        # Four-octet IP followed by only `.br` (2 chars after the dot) — too short.
-        self.assertIsNone(self.dpo.extract_brand_tail("1-2-3-4.br"))
-
-    def test_detect_clusters_meets_threshold(self):
-        domains = [
-            "1-2-3-4.cprapid.com",
-            "5-6-7-8.cprapid.com",
-            "9-10-11-12.cprapid.com",
-            "1-2-3-4-other.com.br",  # not enough of these
-        ]
-        clusters = self.dpo.detect_clusters(domains, threshold=3, known_overrides=set())
-        self.assertIn(".cprapid.com", clusters)
-        self.assertEqual(len(clusters[".cprapid.com"]), 3)
-        self.assertNotIn("-other.com.br", clusters)
-
-    def test_detect_clusters_honours_threshold(self):
-        domains = [
-            "1-2-3-4.cprapid.com",
-            "5-6-7-8.cprapid.com",
-        ]
-        clusters = self.dpo.detect_clusters(domains, threshold=3, known_overrides=set())
-        self.assertEqual(clusters, {})
-
-    def test_detect_clusters_skips_known_overrides(self):
-        """Tails already in psl_overrides.txt must not be re-proposed."""
-        domains = [
-            "1-2-3-4.cprapid.com",
-            "5-6-7-8.cprapid.com",
-            "9-10-11-12.cprapid.com",
-        ]
-        clusters = self.dpo.detect_clusters(
-            domains, threshold=3, known_overrides={".cprapid.com"}
-        )
-        self.assertNotIn(".cprapid.com", clusters)
-
-    def test_apply_override_matches_first(self):
-        """apply_override iterates in list order and returns on the first match."""
-        ov = [".cprapid.com", "-nobre.com.br"]
-        self.assertEqual(
-            self.dpo.apply_override("1-2-3-4.cprapid.com", ov), "cprapid.com"
-        )
-        self.assertEqual(
-            self.dpo.apply_override("1-2-3-4-nobre.com.br", ov), "nobre.com.br"
-        )
-        self.assertEqual(self.dpo.apply_override("unrelated.com", ov), "unrelated.com")
-
-    def test_has_full_ip_shared_with_other_scripts(self):
-        """The detect script's IP check must agree with the other map scripts."""
-        self.assertTrue(self.dpo.has_full_ip("74-208-244-234.cprapid.com"))
-        self.assertFalse(self.dpo.has_full_ip("ip-147-135-108.us"))
-        self.assertFalse(self.dpo.has_full_ip("example.com"))
-
-
-class TestIsMbox(unittest.TestCase):
-    """Tests for is_mbox utility"""
-
-    def testValidMbox(self):
-        """is_mbox returns True for valid mbox file"""
-        with NamedTemporaryFile(suffix=".mbox", delete=False, mode="w") as f:
-            f.write("From test@example.com Thu Jan  1 00:00:00 2024\n")
-            f.write("Subject: Test\n\nBody\n\n")
-            path = f.name
-        try:
-            self.assertTrue(parsedmarc.utils.is_mbox(path))
-        finally:
-            os.remove(path)
-
-    def testEmptyFileNotMbox(self):
-        """is_mbox returns False for empty file"""
-        with NamedTemporaryFile(suffix=".mbox", delete=False) as f:
-            path = f.name
-        try:
-            self.assertFalse(parsedmarc.utils.is_mbox(path))
-        finally:
-            os.remove(path)
-
-    def testNonExistentNotMbox(self):
-        """is_mbox returns False for non-existent file"""
-        self.assertFalse(parsedmarc.utils.is_mbox("/nonexistent/file.mbox"))
-
-
-if __name__ == "__main__":
-    unittest.main(verbosity=2)
diff --git a/tests/__init__.py b/tests/__init__.py
new file mode 100644
index 0000000..e69de29
diff --git a/tests/test_cli.py b/tests/test_cli.py
new file mode 100644
index 0000000..51d1c34
--- /dev/null
+++ b/tests/test_cli.py
@@ -0,0 +1,1809 @@
+"""Tests for parsedmarc.cli — CLI entry point, config parsing,
+env-var overrides, mailbox watch wiring, and SIGHUP reload."""
+
+import io
+import json
+import os
+import signal
+import sys
+import tempfile
+import unittest
+from configparser import ConfigParser
+from tempfile import NamedTemporaryFile
+from types import SimpleNamespace
+from typing import cast
+from unittest.mock import MagicMock, patch
+
+import parsedmarc
+import parsedmarc.cli
+import parsedmarc.opensearch as opensearch_module
+
+
+class _BreakLoop(BaseException):
+    pass
+
+
+class _DummyMailboxConnection(parsedmarc.MailboxConnection):
+    def __init__(self):
+        self.fetch_calls: list[dict[str, object]] = []
+
+    def create_folder(self, folder_name: str):
+        return None
+
+    def fetch_messages(self, reports_folder: str, **kwargs):
+        self.fetch_calls.append({"reports_folder": reports_folder, **kwargs})
+        return []
+
+    def fetch_message(self, message_id) -> str:
+        return ""
+
+    def delete_message(self, message_id):
+        return None
+
+    def move_message(self, message_id, folder_name: str):
+        return None
+
+    def keepalive(self):
+        return None
+
+    def watch(self, check_callback, check_timeout, config_reloading=None):
+        return None
+
+
+class Test(unittest.TestCase):
+    """Kitchen-sink tests redistributed from the original
+    tests.py monolith. Future PRs should split these further
+    into purpose-specific TestCase subclasses as natural
+    groupings emerge."""
+
+    def testOpenSearchSigV4RequiresRegion(self):
+        with self.assertRaises(opensearch_module.OpenSearchError):
+            opensearch_module.set_hosts(
+                "https://example.org:9200",
+                auth_type="awssigv4",
+            )
+
+    def testOpenSearchSigV4ConfiguresConnectionClass(self):
+        fake_credentials = object()
+        with patch.object(opensearch_module.boto3, "Session") as session_cls:
+            session_cls.return_value.get_credentials.return_value = fake_credentials
+            with patch.object(
+                opensearch_module, "AWSV4SignerAuth", return_value="auth"
+            ) as signer:
+                with patch.object(
+                    opensearch_module.connections, "create_connection"
+                ) as create_connection:
+                    opensearch_module.set_hosts(
+                        "https://example.org:9200",
+                        use_ssl=True,
+                        auth_type="awssigv4",
+                        aws_region="eu-west-1",
+                    )
+        signer.assert_called_once_with(fake_credentials, "eu-west-1", "es")
+        create_connection.assert_called_once()
+        self.assertEqual(
+            create_connection.call_args.kwargs.get("connection_class"),
+            opensearch_module.RequestsHttpConnection,
+        )
+        self.assertEqual(create_connection.call_args.kwargs.get("http_auth"), "auth")
+
+    def testOpenSearchSigV4RejectsUnknownAuthType(self):
+        with self.assertRaises(opensearch_module.OpenSearchError):
+            opensearch_module.set_hosts(
+                "https://example.org:9200",
+                auth_type="kerberos",
+            )
+
+    def testOpenSearchSigV4RequiresAwsCredentials(self):
+        with patch.object(opensearch_module.boto3, "Session") as session_cls:
+            session_cls.return_value.get_credentials.return_value = None
+            with self.assertRaises(opensearch_module.OpenSearchError):
+                opensearch_module.set_hosts(
+                    "https://example.org:9200",
+                    auth_type="awssigv4",
+                    aws_region="eu-west-1",
+                )
+
+    @patch("parsedmarc.cli.opensearch.migrate_indexes")
+    @patch("parsedmarc.cli.opensearch.set_hosts")
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.IMAPConnection")
+    def testCliPassesOpenSearchSigV4Settings(
+        self,
+        mock_imap_connection,
+        mock_get_reports,
+        mock_set_hosts,
+        _mock_migrate_indexes,
+    ):
+        mock_imap_connection.return_value = object()
+        mock_get_reports.return_value = {
+            "aggregate_reports": [],
+            "failure_reports": [],
+            "smtp_tls_reports": [],
+        }
+
+        config = """[general]
+save_aggregate = true
+silent = true
+
+[imap]
+host = imap.example.com
+user = test-user
+password = test-password
+
+[opensearch]
+hosts = localhost
+authentication_type = awssigv4
+aws_region = eu-west-1
+aws_service = aoss
+"""
+        with tempfile.NamedTemporaryFile(
+            "w", suffix=".ini", delete=False
+        ) as config_file:
+            config_file.write(config)
+            config_path = config_file.name
+        self.addCleanup(lambda: os.path.exists(config_path) and os.remove(config_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", config_path]):
+            parsedmarc.cli._main()
+
+        self.assertEqual(mock_set_hosts.call_args.kwargs.get("auth_type"), "awssigv4")
+        self.assertEqual(mock_set_hosts.call_args.kwargs.get("aws_region"), "eu-west-1")
+        self.assertEqual(mock_set_hosts.call_args.kwargs.get("aws_service"), "aoss")
+
+    @patch("parsedmarc.cli.elastic.save_aggregate_report_to_elasticsearch")
+    @patch("parsedmarc.cli.elastic.migrate_indexes")
+    @patch("parsedmarc.cli.elastic.set_hosts")
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.IMAPConnection")
+    def testFailOnOutputErrorExits(
+        self,
+        mock_imap_connection,
+        mock_get_reports,
+        _mock_set_hosts,
+        _mock_migrate_indexes,
+        mock_save_aggregate,
+    ):
+        """CLI should exit with code 1 when fail_on_output_error is enabled"""
+        mock_imap_connection.return_value = object()
+        mock_get_reports.return_value = {
+            "aggregate_reports": [{"policy_published": {"domain": "example.com"}}],
+            "failure_reports": [],
+            "smtp_tls_reports": [],
+        }
+        mock_save_aggregate.side_effect = parsedmarc.elastic.ElasticsearchError(
+            "simulated output failure"
+        )
+
+        config = """[general]
+save_aggregate = true
+fail_on_output_error = true
+silent = true
+
+[imap]
+host = imap.example.com
+user = test-user
+password = test-password
+
+[elasticsearch]
+hosts = localhost
+"""
+        with tempfile.NamedTemporaryFile(
+            "w", suffix=".ini", delete=False
+        ) as config_file:
+            config_file.write(config)
+            config_path = config_file.name
+        self.addCleanup(lambda: os.path.exists(config_path) and os.remove(config_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", config_path]):
+            with self.assertRaises(SystemExit) as ctx:
+                parsedmarc.cli._main()
+
+        self.assertEqual(ctx.exception.code, 1)
+        mock_save_aggregate.assert_called_once()
+
+    @patch("parsedmarc.cli.elastic.save_aggregate_report_to_elasticsearch")
+    @patch("parsedmarc.cli.elastic.migrate_indexes")
+    @patch("parsedmarc.cli.elastic.set_hosts")
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.IMAPConnection")
+    def testOutputErrorDoesNotExitWhenDisabled(
+        self,
+        mock_imap_connection,
+        mock_get_reports,
+        _mock_set_hosts,
+        _mock_migrate_indexes,
+        mock_save_aggregate,
+    ):
+        mock_imap_connection.return_value = object()
+        mock_get_reports.return_value = {
+            "aggregate_reports": [{"policy_published": {"domain": "example.com"}}],
+            "failure_reports": [],
+            "smtp_tls_reports": [],
+        }
+        mock_save_aggregate.side_effect = parsedmarc.elastic.ElasticsearchError(
+            "simulated output failure"
+        )
+
+        config = """[general]
+save_aggregate = true
+fail_on_output_error = false
+silent = true
+
+[imap]
+host = imap.example.com
+user = test-user
+password = test-password
+
+[elasticsearch]
+hosts = localhost
+"""
+        with tempfile.NamedTemporaryFile(
+            "w", suffix=".ini", delete=False
+        ) as config_file:
+            config_file.write(config)
+            config_path = config_file.name
+        self.addCleanup(lambda: os.path.exists(config_path) and os.remove(config_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", config_path]):
+            parsedmarc.cli._main()
+
+        mock_save_aggregate.assert_called_once()
+
+    @patch("parsedmarc.cli.opensearch.save_failure_report_to_opensearch")
+    @patch("parsedmarc.cli.opensearch.migrate_indexes")
+    @patch("parsedmarc.cli.opensearch.set_hosts")
+    @patch("parsedmarc.cli.elastic.save_failure_report_to_elasticsearch")
+    @patch("parsedmarc.cli.elastic.save_aggregate_report_to_elasticsearch")
+    @patch("parsedmarc.cli.elastic.migrate_indexes")
+    @patch("parsedmarc.cli.elastic.set_hosts")
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.IMAPConnection")
+    def testFailOnOutputErrorExitsWithMultipleSinkErrors(
+        self,
+        mock_imap_connection,
+        mock_get_reports,
+        _mock_es_set_hosts,
+        _mock_es_migrate,
+        mock_save_aggregate,
+        _mock_save_failure_elastic,
+        _mock_os_set_hosts,
+        _mock_os_migrate,
+        mock_save_failure_opensearch,
+    ):
+        mock_imap_connection.return_value = object()
+        mock_get_reports.return_value = {
+            "aggregate_reports": [{"policy_published": {"domain": "example.com"}}],
+            "failure_reports": [{"reported_domain": "example.com"}],
+            "smtp_tls_reports": [],
+        }
+        mock_save_aggregate.side_effect = parsedmarc.elastic.ElasticsearchError(
+            "aggregate sink failed"
+        )
+        mock_save_failure_opensearch.side_effect = (
+            parsedmarc.cli.opensearch.OpenSearchError("failure sink failed")
+        )
+
+        config = """[general]
+save_aggregate = true
+save_failure = true
+fail_on_output_error = true
+silent = true
+
+[imap]
+host = imap.example.com
+user = test-user
+password = test-password
+
+[elasticsearch]
+hosts = localhost
+
+[opensearch]
+hosts = localhost
+"""
+        with tempfile.NamedTemporaryFile(
+            "w", suffix=".ini", delete=False
+        ) as config_file:
+            config_file.write(config)
+            config_path = config_file.name
+        self.addCleanup(lambda: os.path.exists(config_path) and os.remove(config_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", config_path]):
+            with self.assertRaises(SystemExit) as ctx:
+                parsedmarc.cli._main()
+
+        self.assertEqual(ctx.exception.code, 1)
+        mock_save_aggregate.assert_called_once()
+        mock_save_failure_opensearch.assert_called_once()
+
+    def test_resolve_section_key_simple(self):
+        """Simple section names resolve correctly."""
+        from parsedmarc.cli import _resolve_section_key
+
+        self.assertEqual(_resolve_section_key("IMAP_PASSWORD"), ("imap", "password"))
+        self.assertEqual(_resolve_section_key("GENERAL_DEBUG"), ("general", "debug"))
+        self.assertEqual(_resolve_section_key("S3_BUCKET"), ("s3", "bucket"))
+        self.assertEqual(_resolve_section_key("GELF_HOST"), ("gelf", "host"))
+
+    def test_resolve_section_key_underscore_sections(self):
+        """Multi-word section names (splunk_hec, gmail_api, etc.) resolve correctly."""
+        from parsedmarc.cli import _resolve_section_key
+
+        self.assertEqual(
+            _resolve_section_key("SPLUNK_HEC_TOKEN"), ("splunk_hec", "token")
+        )
+        self.assertEqual(
+            _resolve_section_key("GMAIL_API_CREDENTIALS_FILE"),
+            ("gmail_api", "credentials_file"),
+        )
+        self.assertEqual(
+            _resolve_section_key("LOG_ANALYTICS_CLIENT_ID"),
+            ("log_analytics", "client_id"),
+        )
+
+    def test_resolve_section_key_unknown(self):
+        """Unknown prefixes return (None, None)."""
+        from parsedmarc.cli import _resolve_section_key
+
+        self.assertEqual(_resolve_section_key("UNKNOWN_FOO"), (None, None))
+        # Just a section name with no key should not match
+        self.assertEqual(_resolve_section_key("IMAP"), (None, None))
+
+    def test_apply_env_overrides_injects_values(self):
+        """Env vars are injected into an existing ConfigParser."""
+        from configparser import ConfigParser
+        from parsedmarc.cli import _apply_env_overrides
+
+        config = ConfigParser()
+        config.add_section("imap")
+        config.set("imap", "host", "original.example.com")
+
+        env = {
+            "PARSEDMARC_IMAP_HOST": "new.example.com",
+            "PARSEDMARC_IMAP_PASSWORD": "secret123",
+        }
+        with patch.dict(os.environ, env, clear=False):
+            _apply_env_overrides(config)
+
+        self.assertEqual(config.get("imap", "host"), "new.example.com")
+        self.assertEqual(config.get("imap", "password"), "secret123")
+
+    def test_apply_env_overrides_creates_sections(self):
+        """Env vars create new sections when they don't exist."""
+        from configparser import ConfigParser
+        from parsedmarc.cli import _apply_env_overrides
+
+        config = ConfigParser()
+
+        env = {"PARSEDMARC_ELASTICSEARCH_HOSTS": "http://localhost:9200"}
+        with patch.dict(os.environ, env, clear=False):
+            _apply_env_overrides(config)
+
+        self.assertTrue(config.has_section("elasticsearch"))
+        self.assertEqual(config.get("elasticsearch", "hosts"), "http://localhost:9200")
+
+    def test_apply_env_overrides_ignores_config_file_var(self):
+        """PARSEDMARC_CONFIG_FILE is not injected as a config key."""
+        from configparser import ConfigParser
+        from parsedmarc.cli import _apply_env_overrides
+
+        config = ConfigParser()
+
+        env = {"PARSEDMARC_CONFIG_FILE": "/some/path.ini"}
+        with patch.dict(os.environ, env, clear=False):
+            _apply_env_overrides(config)
+
+        self.assertEqual(config.sections(), [])
+
+    def test_load_config_with_file_and_env_override(self):
+        """Env vars override values from an INI file."""
+        from parsedmarc.cli import _load_config
+
+        with NamedTemporaryFile(mode="w", suffix=".ini", delete=False) as f:
+            f.write(
+                "[imap]\nhost = file.example.com\nuser = alice\npassword = fromfile\n"
+            )
+            f.flush()
+            config_path = f.name
+
+        try:
+            env = {"PARSEDMARC_IMAP_PASSWORD": "fromenv"}
+            with patch.dict(os.environ, env, clear=False):
+                config = _load_config(config_path)
+
+            self.assertEqual(config.get("imap", "host"), "file.example.com")
+            self.assertEqual(config.get("imap", "user"), "alice")
+            self.assertEqual(config.get("imap", "password"), "fromenv")
+        finally:
+            os.unlink(config_path)
+
+    def test_load_config_env_only(self):
+        """Config can be loaded purely from env vars with no file."""
+        from parsedmarc.cli import _load_config
+
+        env = {
+            "PARSEDMARC_GENERAL_DEBUG": "true",
+            "PARSEDMARC_ELASTICSEARCH_HOSTS": "http://localhost:9200",
+        }
+        with patch.dict(os.environ, env, clear=False):
+            config = _load_config(None)
+
+        self.assertEqual(config.get("general", "debug"), "true")
+        self.assertEqual(config.get("elasticsearch", "hosts"), "http://localhost:9200")
+
+    def test_parse_config_from_env(self):
+        """Full round-trip: env vars -> ConfigParser -> opts."""
+        from argparse import Namespace
+        from parsedmarc.cli import _load_config, _parse_config
+
+        env = {
+            "PARSEDMARC_GENERAL_DEBUG": "true",
+            "PARSEDMARC_GENERAL_SAVE_AGGREGATE": "true",
+            "PARSEDMARC_GENERAL_OFFLINE": "true",
+        }
+        with patch.dict(os.environ, env, clear=False):
+            config = _load_config(None)
+
+        opts = Namespace()
+        _parse_config(config, opts)
+
+        self.assertTrue(opts.debug)
+        self.assertTrue(opts.save_aggregate)
+        self.assertTrue(opts.offline)
+
+    def test_config_file_env_var(self):
+        """PARSEDMARC_CONFIG_FILE env var specifies the config file path."""
+        from argparse import Namespace
+        from parsedmarc.cli import _load_config, _parse_config
+
+        with NamedTemporaryFile(mode="w", suffix=".ini", delete=False) as f:
+            f.write("[general]\ndebug = true\noffline = true\n")
+            f.flush()
+            config_path = f.name
+
+        try:
+            env = {"PARSEDMARC_CONFIG_FILE": config_path}
+            with patch.dict(os.environ, env, clear=False):
+                config = _load_config(os.environ.get("PARSEDMARC_CONFIG_FILE"))
+
+            opts = Namespace()
+            _parse_config(config, opts)
+            self.assertTrue(opts.debug)
+            self.assertTrue(opts.offline)
+        finally:
+            os.unlink(config_path)
+
+    def test_boolean_values_from_env(self):
+        """Various boolean string representations work through ConfigParser."""
+        from configparser import ConfigParser
+        from parsedmarc.cli import _apply_env_overrides
+
+        for true_val in ("true", "yes", "1", "on", "True", "YES"):
+            config = ConfigParser()
+            env = {"PARSEDMARC_GENERAL_DEBUG": true_val}
+            with patch.dict(os.environ, env, clear=False):
+                _apply_env_overrides(config)
+            self.assertTrue(
+                config.getboolean("general", "debug"),
+                f"Expected truthy for {true_val!r}",
+            )
+
+        for false_val in ("false", "no", "0", "off", "False", "NO"):
+            config = ConfigParser()
+            env = {"PARSEDMARC_GENERAL_DEBUG": false_val}
+            with patch.dict(os.environ, env, clear=False):
+                _apply_env_overrides(config)
+            self.assertFalse(
+                config.getboolean("general", "debug"),
+                f"Expected falsy for {false_val!r}",
+            )
+
+
+class TestGmailAuthModes(unittest.TestCase):
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.GmailConnection")
+    def testCliPassesGmailServiceAccountAuthSettings(
+        self, mock_gmail_connection, mock_get_mailbox_reports
+    ):
+        mock_gmail_connection.return_value = MagicMock()
+        mock_get_mailbox_reports.return_value = {
+            "aggregate_reports": [],
+            "failure_reports": [],
+            "smtp_tls_reports": [],
+        }
+        config = """[general]
+silent = true
+
+[gmail_api]
+credentials_file = /tmp/service-account.json
+auth_mode = service_account
+service_account_user = dmarc@example.com
+scopes = https://www.googleapis.com/auth/gmail.modify
+"""
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg_file:
+            cfg_file.write(config)
+            config_path = cfg_file.name
+        self.addCleanup(lambda: os.path.exists(config_path) and os.remove(config_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", config_path]):
+            parsedmarc.cli._main()
+
+        self.assertEqual(
+            mock_gmail_connection.call_args.kwargs.get("auth_mode"), "service_account"
+        )
+        self.assertEqual(
+            mock_gmail_connection.call_args.kwargs.get("service_account_user"),
+            "dmarc@example.com",
+        )
+
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.GmailConnection")
+    def testCliAcceptsDelegatedUserAlias(self, mock_gmail_connection, mock_get_reports):
+        mock_gmail_connection.return_value = MagicMock()
+        mock_get_reports.return_value = {
+            "aggregate_reports": [],
+            "failure_reports": [],
+            "smtp_tls_reports": [],
+        }
+        config = """[general]
+silent = true
+
+[gmail_api]
+credentials_file = /tmp/service-account.json
+auth_mode = service_account
+delegated_user = delegated@example.com
+scopes = https://www.googleapis.com/auth/gmail.modify
+"""
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg_file:
+            cfg_file.write(config)
+            config_path = cfg_file.name
+        self.addCleanup(lambda: os.path.exists(config_path) and os.remove(config_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", config_path]):
+            parsedmarc.cli._main()
+
+        self.assertEqual(
+            mock_gmail_connection.call_args.kwargs.get("service_account_user"),
+            "delegated@example.com",
+        )
+
+
+class TestMailboxWatchSince(unittest.TestCase):
+    def setUp(self):
+        from parsedmarc.log import logger as _logger
+
+        _logger.disabled = True
+        self._stdout_patch = patch("sys.stdout", new_callable=io.StringIO)
+        self._stderr_patch = patch("sys.stderr", new_callable=io.StringIO)
+        self._stdout_patch.start()
+        self._stderr_patch.start()
+
+    def tearDown(self):
+        from parsedmarc.log import logger as _logger
+
+        _logger.disabled = False
+        self._stderr_patch.stop()
+        self._stdout_patch.stop()
+
+    def testWatchInboxPassesSinceToMailboxFetch(self):
+        mailbox_connection = SimpleNamespace()
+
+        def fake_watch(check_callback, check_timeout, config_reloading=None):
+            check_callback(mailbox_connection)
+            raise _BreakLoop()
+
+        mailbox_connection.watch = fake_watch
+        callback = MagicMock()
+        with patch.object(
+            parsedmarc, "get_dmarc_reports_from_mailbox", return_value={}
+        ) as mocked:
+            with self.assertRaises(_BreakLoop):
+                parsedmarc.watch_inbox(
+                    mailbox_connection=cast(
+                        parsedmarc.MailboxConnection, mailbox_connection
+                    ),
+                    callback=callback,
+                    check_timeout=1,
+                    batch_size=10,
+                    since="1d",
+                )
+        self.assertEqual(mocked.call_args.kwargs.get("since"), "1d")
+
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.watch_inbox")
+    @patch("parsedmarc.cli.IMAPConnection")
+    def testCliPassesSinceToWatchInbox(
+        self, mock_imap_connection, mock_watch_inbox, mock_get_mailbox_reports
+    ):
+        mock_imap_connection.return_value = object()
+        mock_get_mailbox_reports.return_value = {
+            "aggregate_reports": [],
+            "failure_reports": [],
+            "smtp_tls_reports": [],
+        }
+        mock_watch_inbox.side_effect = FileExistsError("stop-watch-loop")
+
+        config_text = """[general]
+silent = true
+
+[imap]
+host = imap.example.com
+user = user
+password = pass
+
+[mailbox]
+watch = true
+since = 2d
+"""
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(config_text)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            with self.assertRaises(SystemExit) as system_exit:
+                parsedmarc.cli._main()
+
+        self.assertEqual(system_exit.exception.code, 1)
+        self.assertEqual(mock_watch_inbox.call_args.kwargs.get("since"), "2d")
+
+
+class TestMailboxPerformance(unittest.TestCase):
+    def setUp(self):
+        from parsedmarc.log import logger as _logger
+
+        _logger.disabled = True
+        self._stdout_patch = patch("sys.stdout", new_callable=io.StringIO)
+        self._stderr_patch = patch("sys.stderr", new_callable=io.StringIO)
+        self._stdout_patch.start()
+        self._stderr_patch.start()
+
+    def tearDown(self):
+        from parsedmarc.log import logger as _logger
+
+        _logger.disabled = False
+        self._stderr_patch.stop()
+        self._stdout_patch.stop()
+
+    def testBatchModeAvoidsExtraFullFetch(self):
+        connection = _DummyMailboxConnection()
+        parsedmarc.get_dmarc_reports_from_mailbox(
+            connection=connection,
+            reports_folder="INBOX",
+            test=True,
+            batch_size=10,
+            create_folders=False,
+        )
+        self.assertEqual(len(connection.fetch_calls), 1)
+
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.MSGraphConnection")
+    def testCliPassesMsGraphCertificateAuthSettings(
+        self, mock_graph_connection, mock_get_mailbox_reports
+    ):
+        mock_graph_connection.return_value = object()
+        mock_get_mailbox_reports.return_value = {
+            "aggregate_reports": [],
+            "failure_reports": [],
+            "smtp_tls_reports": [],
+        }
+
+        config_text = """[general]
+silent = true
+
+[msgraph]
+auth_method = Certificate
+client_id = client-id
+tenant_id = tenant-id
+mailbox = shared@example.com
+certificate_path = /tmp/msgraph-cert.pem
+certificate_password = cert-pass
+"""
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(config_text)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            parsedmarc.cli._main()
+
+        self.assertEqual(
+            mock_graph_connection.call_args.kwargs.get("auth_method"), "Certificate"
+        )
+        self.assertEqual(
+            mock_graph_connection.call_args.kwargs.get("certificate_path"),
+            "/tmp/msgraph-cert.pem",
+        )
+        self.assertEqual(
+            mock_graph_connection.call_args.kwargs.get("certificate_password"),
+            "cert-pass",
+        )
+
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.MSGraphConnection")
+    @patch("parsedmarc.cli.logger")
+    def testCliRequiresMsGraphCertificatePath(
+        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
+    ):
+        config_text = """[general]
+silent = true
+
+[msgraph]
+auth_method = Certificate
+client_id = client-id
+tenant_id = tenant-id
+mailbox = shared@example.com
+"""
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(config_text)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            with self.assertRaises(SystemExit) as system_exit:
+                parsedmarc.cli._main()
+
+        self.assertEqual(system_exit.exception.code, -1)
+        mock_logger.critical.assert_called_once_with(
+            "certificate_path setting missing from the msgraph config section"
+        )
+        mock_graph_connection.assert_not_called()
+        mock_get_mailbox_reports.assert_not_called()
+
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.MSGraphConnection")
+    def testCliUsesMsGraphUserAsMailboxForUsernamePasswordAuth(
+        self, mock_graph_connection, mock_get_mailbox_reports
+    ):
+        mock_graph_connection.return_value = object()
+        mock_get_mailbox_reports.return_value = {
+            "aggregate_reports": [],
+            "failure_reports": [],
+            "smtp_tls_reports": [],
+        }
+
+        config_text = """[general]
+silent = true
+
+[msgraph]
+auth_method = UsernamePassword
+client_id = client-id
+client_secret = client-secret
+user = owner@example.com
+password = test-password
+"""
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(config_text)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            parsedmarc.cli._main()
+
+        self.assertEqual(
+            mock_graph_connection.call_args.kwargs.get("mailbox"),
+            "owner@example.com",
+        )
+        self.assertEqual(
+            mock_graph_connection.call_args.kwargs.get("username"),
+            "owner@example.com",
+        )
+
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.MSGraphConnection")
+    @patch("parsedmarc.cli.logger")
+    def testCliRequiresMsGraphPasswordForUsernamePasswordAuth(
+        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
+    ):
+        config_text = """[general]
+silent = true
+
+[msgraph]
+auth_method = UsernamePassword
+client_id = client-id
+client_secret = client-secret
+user = owner@example.com
+"""
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(config_text)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            with self.assertRaises(SystemExit) as system_exit:
+                parsedmarc.cli._main()
+
+        self.assertEqual(system_exit.exception.code, -1)
+        mock_logger.critical.assert_called_once_with(
+            "password setting missing from the msgraph config section"
+        )
+        mock_graph_connection.assert_not_called()
+        mock_get_mailbox_reports.assert_not_called()
+
+
+class TestMSGraphCliValidation(unittest.TestCase):
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.MSGraphConnection")
+    def testCliPassesMsGraphClientSecretAuthSettings(
+        self, mock_graph_connection, mock_get_mailbox_reports
+    ):
+        mock_graph_connection.return_value = object()
+        mock_get_mailbox_reports.return_value = {
+            "aggregate_reports": [],
+            "failure_reports": [],
+            "smtp_tls_reports": [],
+        }
+
+        config_text = """[general]
+silent = true
+
+[msgraph]
+auth_method = ClientSecret
+client_id = client-id
+client_secret = client-secret
+tenant_id = tenant-id
+mailbox = shared@example.com
+"""
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(config_text)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            parsedmarc.cli._main()
+
+        self.assertEqual(
+            mock_graph_connection.call_args.kwargs.get("auth_method"), "ClientSecret"
+        )
+        self.assertEqual(
+            mock_graph_connection.call_args.kwargs.get("client_secret"),
+            "client-secret",
+        )
+        self.assertEqual(
+            mock_graph_connection.call_args.kwargs.get("tenant_id"), "tenant-id"
+        )
+        self.assertEqual(
+            mock_graph_connection.call_args.kwargs.get("mailbox"),
+            "shared@example.com",
+        )
+
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.MSGraphConnection")
+    @patch("parsedmarc.cli.logger")
+    def testCliRequiresMsGraphClientSecretForClientSecretAuth(
+        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
+    ):
+        config_text = """[general]
+silent = true
+
+[msgraph]
+auth_method = ClientSecret
+client_id = client-id
+tenant_id = tenant-id
+mailbox = shared@example.com
+"""
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(config_text)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            with self.assertRaises(SystemExit) as system_exit:
+                parsedmarc.cli._main()
+
+        self.assertEqual(system_exit.exception.code, -1)
+        mock_logger.critical.assert_called_once_with(
+            "client_secret setting missing from the msgraph config section"
+        )
+        mock_graph_connection.assert_not_called()
+        mock_get_mailbox_reports.assert_not_called()
+
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.MSGraphConnection")
+    @patch("parsedmarc.cli.logger")
+    def testCliRequiresMsGraphTenantIdForClientSecretAuth(
+        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
+    ):
+        config_text = """[general]
+silent = true
+
+[msgraph]
+auth_method = ClientSecret
+client_id = client-id
+client_secret = client-secret
+mailbox = shared@example.com
+"""
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(config_text)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            with self.assertRaises(SystemExit) as system_exit:
+                parsedmarc.cli._main()
+
+        self.assertEqual(system_exit.exception.code, -1)
+        mock_logger.critical.assert_called_once_with(
+            "tenant_id setting missing from the msgraph config section"
+        )
+        mock_graph_connection.assert_not_called()
+        mock_get_mailbox_reports.assert_not_called()
+
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.MSGraphConnection")
+    @patch("parsedmarc.cli.logger")
+    def testCliRequiresMsGraphMailboxForClientSecretAuth(
+        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
+    ):
+        config_text = """[general]
+silent = true
+
+[msgraph]
+auth_method = ClientSecret
+client_id = client-id
+client_secret = client-secret
+tenant_id = tenant-id
+"""
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(config_text)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            with self.assertRaises(SystemExit) as system_exit:
+                parsedmarc.cli._main()
+
+        self.assertEqual(system_exit.exception.code, -1)
+        mock_logger.critical.assert_called_once_with(
+            "mailbox setting missing from the msgraph config section"
+        )
+        mock_graph_connection.assert_not_called()
+        mock_get_mailbox_reports.assert_not_called()
+
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.MSGraphConnection")
+    def testCliAllowsMsGraphDeviceCodeWithoutUser(
+        self, mock_graph_connection, mock_get_mailbox_reports
+    ):
+        mock_graph_connection.return_value = object()
+        mock_get_mailbox_reports.return_value = {
+            "aggregate_reports": [],
+            "failure_reports": [],
+            "smtp_tls_reports": [],
+        }
+
+        config_text = """[general]
+silent = true
+
+[msgraph]
+auth_method = DeviceCode
+client_id = client-id
+tenant_id = tenant-id
+mailbox = shared@example.com
+"""
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(config_text)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            parsedmarc.cli._main()
+
+        self.assertEqual(
+            mock_graph_connection.call_args.kwargs.get("auth_method"), "DeviceCode"
+        )
+        self.assertEqual(
+            mock_graph_connection.call_args.kwargs.get("mailbox"),
+            "shared@example.com",
+        )
+        self.assertIsNone(mock_graph_connection.call_args.kwargs.get("username"))
+
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.MSGraphConnection")
+    @patch("parsedmarc.cli.logger")
+    def testCliRequiresMsGraphTenantIdForDeviceCodeAuth(
+        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
+    ):
+        config_text = """[general]
+silent = true
+
+[msgraph]
+auth_method = DeviceCode
+client_id = client-id
+mailbox = shared@example.com
+"""
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(config_text)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            with self.assertRaises(SystemExit) as system_exit:
+                parsedmarc.cli._main()
+
+        self.assertEqual(system_exit.exception.code, -1)
+        mock_logger.critical.assert_called_once_with(
+            "tenant_id setting missing from the msgraph config section"
+        )
+        mock_graph_connection.assert_not_called()
+        mock_get_mailbox_reports.assert_not_called()
+
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.MSGraphConnection")
+    @patch("parsedmarc.cli.logger")
+    def testCliRequiresMsGraphMailboxForDeviceCodeAuth(
+        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
+    ):
+        config_text = """[general]
+silent = true
+
+[msgraph]
+auth_method = DeviceCode
+client_id = client-id
+tenant_id = tenant-id
+"""
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(config_text)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            with self.assertRaises(SystemExit) as system_exit:
+                parsedmarc.cli._main()
+
+        self.assertEqual(system_exit.exception.code, -1)
+        mock_logger.critical.assert_called_once_with(
+            "mailbox setting missing from the msgraph config section"
+        )
+        mock_graph_connection.assert_not_called()
+        mock_get_mailbox_reports.assert_not_called()
+
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.MSGraphConnection")
+    @patch("parsedmarc.cli.logger")
+    def testCliRequiresMsGraphTenantIdForCertificateAuth(
+        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
+    ):
+        config_text = """[general]
+silent = true
+
+[msgraph]
+auth_method = Certificate
+client_id = client-id
+mailbox = shared@example.com
+certificate_path = /tmp/msgraph-cert.pem
+"""
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(config_text)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            with self.assertRaises(SystemExit) as system_exit:
+                parsedmarc.cli._main()
+
+        self.assertEqual(system_exit.exception.code, -1)
+        mock_logger.critical.assert_called_once_with(
+            "tenant_id setting missing from the msgraph config section"
+        )
+        mock_graph_connection.assert_not_called()
+        mock_get_mailbox_reports.assert_not_called()
+
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.MSGraphConnection")
+    @patch("parsedmarc.cli.logger")
+    def testCliRequiresMsGraphMailboxForCertificateAuth(
+        self, mock_logger, mock_graph_connection, mock_get_mailbox_reports
+    ):
+        config_text = """[general]
+silent = true
+
+[msgraph]
+auth_method = Certificate
+client_id = client-id
+tenant_id = tenant-id
+certificate_path = /tmp/msgraph-cert.pem
+"""
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(config_text)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            with self.assertRaises(SystemExit) as system_exit:
+                parsedmarc.cli._main()
+
+        self.assertEqual(system_exit.exception.code, -1)
+        mock_logger.critical.assert_called_once_with(
+            "mailbox setting missing from the msgraph config section"
+        )
+        mock_graph_connection.assert_not_called()
+        mock_get_mailbox_reports.assert_not_called()
+
+
+class TestSighupReload(unittest.TestCase):
+    """Tests for SIGHUP-driven configuration reload in watch mode."""
+
+    def setUp(self):
+        from parsedmarc.log import logger as _logger
+
+        _logger.disabled = True
+        self._stdout_patch = patch("sys.stdout", new_callable=io.StringIO)
+        self._stderr_patch = patch("sys.stderr", new_callable=io.StringIO)
+        self._stdout_patch.start()
+        self._stderr_patch.start()
+
+    def tearDown(self):
+        from parsedmarc.log import logger as _logger
+
+        _logger.disabled = False
+        self._stderr_patch.stop()
+        self._stdout_patch.stop()
+
+    _BASE_CONFIG = """[general]
+silent = true
+
+[imap]
+host = imap.example.com
+user = user
+password = pass
+
+[mailbox]
+watch = true
+"""
+
+    @unittest.skipUnless(
+        hasattr(signal, "SIGHUP"),
+        "SIGHUP not available on this platform",
+    )
+    @patch("parsedmarc.cli._init_output_clients")
+    @patch("parsedmarc.cli._parse_config")
+    @patch("parsedmarc.cli._load_config")
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.watch_inbox")
+    @patch("parsedmarc.cli.IMAPConnection")
+    def testSighupTriggersReloadAndWatchRestarts(
+        self,
+        mock_imap,
+        mock_watch,
+        mock_get_reports,
+        mock_load_config,
+        mock_parse_config,
+        mock_init_clients,
+    ):
+        """SIGHUP causes watch to return, config is re-parsed, and watch restarts."""
+        import signal as signal_module
+
+        mock_imap.return_value = object()
+        mock_get_reports.return_value = {
+            "aggregate_reports": [],
+            "failure_reports": [],
+            "smtp_tls_reports": [],
+        }
+
+        mock_load_config.return_value = ConfigParser()
+
+        def parse_side_effect(config, opts):
+            opts.imap_host = "imap.example.com"
+            opts.imap_user = "user"
+            opts.imap_password = "pass"
+            opts.mailbox_watch = True
+            return None
+
+        mock_parse_config.side_effect = parse_side_effect
+        mock_init_clients.return_value = {}
+
+        call_count = [0]
+
+        def watch_side_effect(*args, **kwargs):
+            call_count[0] += 1
+            if call_count[0] == 1:
+                # Simulate SIGHUP arriving while watch is running
+                if hasattr(signal_module, "SIGHUP"):
+                    import os
+
+                    os.kill(os.getpid(), signal_module.SIGHUP)
+                return  # Normal return — reload loop will continue
+            else:
+                raise FileExistsError("stop-watch-loop")
+
+        mock_watch.side_effect = watch_side_effect
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(self._BASE_CONFIG)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            with self.assertRaises(SystemExit) as cm:
+                parsedmarc.cli._main()
+
+        # Exited with code 1 (from FileExistsError handler)
+        self.assertEqual(cm.exception.code, 1)
+        # watch_inbox was called twice: initial run + after reload
+        self.assertEqual(mock_watch.call_count, 2)
+        # _parse_config called for initial load + reload
+        self.assertGreaterEqual(mock_parse_config.call_count, 2)
+
+    @unittest.skipUnless(
+        hasattr(signal, "SIGHUP"),
+        "SIGHUP not available on this platform",
+    )
+    @patch("parsedmarc.cli._init_output_clients")
+    @patch("parsedmarc.cli._parse_config")
+    @patch("parsedmarc.cli._load_config")
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.watch_inbox")
+    @patch("parsedmarc.cli.IMAPConnection")
+    def testInvalidConfigOnReloadKeepsPreviousState(
+        self,
+        mock_imap,
+        mock_watch,
+        mock_get_reports,
+        mock_load_config,
+        mock_parse_config,
+        mock_init_clients,
+    ):
+        """A failing reload leaves opts and clients unchanged."""
+        import signal as signal_module
+
+        mock_imap.return_value = object()
+        mock_get_reports.return_value = {
+            "aggregate_reports": [],
+            "failure_reports": [],
+            "smtp_tls_reports": [],
+        }
+
+        mock_load_config.return_value = ConfigParser()
+
+        # Initial parse sets required opts; reload parse raises
+        initial_map = {"prefix_": ["example.com"]}
+        call_count = [0]
+
+        def parse_side_effect(config, opts):
+            call_count[0] += 1
+            opts.imap_host = "imap.example.com"
+            opts.imap_user = "user"
+            opts.imap_password = "pass"
+            opts.mailbox_watch = True
+            if call_count[0] == 1:
+                return initial_map
+            raise RuntimeError("bad config")
+
+        mock_parse_config.side_effect = parse_side_effect
+
+        initial_clients = {"s3_client": MagicMock()}
+        mock_init_clients.return_value = initial_clients
+
+        watch_calls = [0]
+
+        def watch_side_effect(*args, **kwargs):
+            watch_calls[0] += 1
+            if watch_calls[0] == 1:
+                if hasattr(signal_module, "SIGHUP"):
+                    import os
+
+                    os.kill(os.getpid(), signal_module.SIGHUP)
+                return
+            else:
+                raise FileExistsError("stop")
+
+        mock_watch.side_effect = watch_side_effect
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(self._BASE_CONFIG)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            with self.assertRaises(SystemExit) as cm:
+                parsedmarc.cli._main()
+
+        self.assertEqual(cm.exception.code, 1)
+        # watch was still called twice (reload loop continued after failed reload)
+        self.assertEqual(mock_watch.call_count, 2)
+        # The failed reload must not have closed the original clients
+        initial_clients["s3_client"].close.assert_not_called()
+
+    @unittest.skipUnless(
+        hasattr(signal, "SIGHUP"),
+        "SIGHUP not available on this platform",
+    )
+    @patch("parsedmarc.cli._init_output_clients")
+    @patch("parsedmarc.cli._parse_config")
+    @patch("parsedmarc.cli._load_config")
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.watch_inbox")
+    @patch("parsedmarc.cli.IMAPConnection")
+    def testReloadClosesOldClients(
+        self,
+        mock_imap,
+        mock_watch,
+        mock_get_reports,
+        mock_load_config,
+        mock_parse_config,
+        mock_init_clients,
+    ):
+        """Successful reload closes the old output clients before replacing them."""
+        import signal as signal_module
+
+        mock_imap.return_value = object()
+        mock_get_reports.return_value = {
+            "aggregate_reports": [],
+            "failure_reports": [],
+            "smtp_tls_reports": [],
+        }
+
+        mock_load_config.return_value = ConfigParser()
+
+        def parse_side_effect(config, opts):
+            opts.imap_host = "imap.example.com"
+            opts.imap_user = "user"
+            opts.imap_password = "pass"
+            opts.mailbox_watch = True
+            return None
+
+        mock_parse_config.side_effect = parse_side_effect
+
+        old_client = MagicMock()
+        new_client = MagicMock()
+        init_call = [0]
+
+        def init_side_effect(opts):
+            init_call[0] += 1
+            if init_call[0] == 1:
+                return {"kafka_client": old_client}
+            return {"kafka_client": new_client}
+
+        mock_init_clients.side_effect = init_side_effect
+
+        watch_calls = [0]
+
+        def watch_side_effect(*args, **kwargs):
+            watch_calls[0] += 1
+            if watch_calls[0] == 1:
+                if hasattr(signal_module, "SIGHUP"):
+                    import os
+
+                    os.kill(os.getpid(), signal_module.SIGHUP)
+                return
+            else:
+                raise FileExistsError("stop")
+
+        mock_watch.side_effect = watch_side_effect
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(self._BASE_CONFIG)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            with self.assertRaises(SystemExit):
+                parsedmarc.cli._main()
+
+        # Old client must have been closed when reload succeeded
+        old_client.close.assert_called_once()
+
+    @unittest.skipUnless(
+        hasattr(signal, "SIGHUP"),
+        "SIGHUP not available on this platform",
+    )
+    @patch("parsedmarc.cli._init_output_clients")
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.watch_inbox")
+    @patch("parsedmarc.cli.IMAPConnection")
+    def testRemovedConfigSectionTakesEffectOnReload(
+        self,
+        mock_imap,
+        mock_watch,
+        mock_get_reports,
+        mock_init_clients,
+    ):
+        """Removing a config section on reload resets that option to its default."""
+        import signal as signal_module
+
+        mock_imap.return_value = object()
+        mock_get_reports.return_value = {
+            "aggregate_reports": [],
+            "failure_reports": [],
+            "smtp_tls_reports": [],
+        }
+        mock_init_clients.return_value = {}
+
+        # First config sets kafka_hosts (with required topics); second removes it.
+        config_v1 = (
+            self._BASE_CONFIG
+            + "\n[kafka]\nhosts = kafka.example.com:9092\n"
+            + "aggregate_topic = dmarc_agg\n"
+            + "forensic_topic = dmarc_forensic\n"
+            + "smtp_tls_topic = smtp_tls\n"
+        )
+        config_v2 = self._BASE_CONFIG  # no [kafka] section
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(config_v1)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        watch_calls = [0]
+
+        def watch_side_effect(*args, **kwargs):
+            watch_calls[0] += 1
+            if watch_calls[0] == 1:
+                # Rewrite config to remove kafka before triggering reload
+                with open(cfg_path, "w") as f:
+                    f.write(config_v2)
+                if hasattr(signal_module, "SIGHUP"):
+                    import os
+
+                    os.kill(os.getpid(), signal_module.SIGHUP)
+                return
+            else:
+                raise FileExistsError("stop")
+
+        mock_watch.side_effect = watch_side_effect
+
+        # Capture opts used on each _init_output_clients call
+        init_opts_captures = []
+
+        def init_side_effect(opts):
+            from argparse import Namespace as NS
+
+            init_opts_captures.append(NS(**vars(opts)))
+            return {}
+
+        mock_init_clients.side_effect = init_side_effect
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            with self.assertRaises(SystemExit):
+                parsedmarc.cli._main()
+
+        # First init: kafka_hosts should be set from v1 config
+        self.assertIsNotNone(init_opts_captures[0].kafka_hosts)
+        # Second init (after reload with v2 config): kafka_hosts should be None
+        self.assertIsNone(init_opts_captures[1].kafka_hosts)
+
+    @unittest.skipUnless(
+        hasattr(signal, "SIGHUP"),
+        "SIGHUP not available on this platform",
+    )
+    @patch("parsedmarc.cli._init_output_clients")
+    @patch("parsedmarc.cli._parse_config")
+    @patch("parsedmarc.cli._load_config")
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.watch_inbox")
+    @patch("parsedmarc.cli.IMAPConnection")
+    def testReloadRefreshesReverseDnsMap(
+        self,
+        mock_imap,
+        mock_watch,
+        mock_get_reports,
+        mock_load_config,
+        mock_parse_config,
+        mock_init_clients,
+    ):
+        """SIGHUP reload repopulates the reverse DNS map so lookups still work."""
+        import signal as signal_module
+
+        from parsedmarc import REVERSE_DNS_MAP
+
+        mock_imap.return_value = object()
+        mock_get_reports.return_value = {
+            "aggregate_reports": [],
+            "failure_reports": [],
+            "smtp_tls_reports": [],
+        }
+
+        mock_load_config.return_value = ConfigParser()
+
+        def parse_side_effect(config, opts):
+            opts.imap_host = "imap.example.com"
+            opts.imap_user = "user"
+            opts.imap_password = "pass"
+            opts.mailbox_watch = True
+            return None
+
+        mock_parse_config.side_effect = parse_side_effect
+        mock_init_clients.return_value = {}
+
+        # Snapshot the map state after each watch_inbox call
+        map_snapshots = []
+
+        watch_calls = [0]
+
+        def watch_side_effect(*args, **kwargs):
+            watch_calls[0] += 1
+            if watch_calls[0] == 1:
+                if hasattr(signal_module, "SIGHUP"):
+                    import os
+
+                    os.kill(os.getpid(), signal_module.SIGHUP)
+                return
+            else:
+                # Capture the map state after reload, before we stop the loop
+                map_snapshots.append(dict(REVERSE_DNS_MAP))
+                raise FileExistsError("stop")
+
+        mock_watch.side_effect = watch_side_effect
+
+        with tempfile.NamedTemporaryFile("w", suffix=".ini", delete=False) as cfg:
+            cfg.write(self._BASE_CONFIG)
+            cfg_path = cfg.name
+        self.addCleanup(lambda: os.path.exists(cfg_path) and os.remove(cfg_path))
+
+        # Pre-populate the map so we can verify it gets refreshed
+        REVERSE_DNS_MAP.clear()
+        REVERSE_DNS_MAP["stale.example.com"] = {
+            "name": "Stale",
+            "type": "stale",
+        }
+        original_contents = dict(REVERSE_DNS_MAP)
+
+        with patch.object(sys, "argv", ["parsedmarc", "-c", cfg_path]):
+            with self.assertRaises(SystemExit):
+                parsedmarc.cli._main()
+
+        self.assertEqual(mock_watch.call_count, 2)
+        # The map should have been repopulated (not empty, not the stale data)
+        self.assertEqual(len(map_snapshots), 1)
+        refreshed = map_snapshots[0]
+        self.assertGreater(len(refreshed), 0, "Map should not be empty after reload")
+        self.assertNotEqual(
+            refreshed,
+            original_contents,
+            "Map should have been refreshed, not kept stale data",
+        )
+        self.assertNotIn(
+            "stale.example.com",
+            refreshed,
+            "Stale entry should have been cleared by reload",
+        )
+
+
+class TestIndexPrefixDomainMapTlsFiltering(unittest.TestCase):
+    """Tests that SMTP TLS reports for unmapped domains are filtered out
+    when index_prefix_domain_map is configured."""
+
+    @patch("parsedmarc.cli.get_dmarc_reports_from_mailbox")
+    @patch("parsedmarc.cli.IMAPConnection")
+    def testTlsReportsFilteredByDomainMap(
+        self,
+        mock_imap_connection,
+        mock_get_reports,
+    ):
+        """TLS reports for domains not in the map should be silently dropped."""
+        mock_imap_connection.return_value = object()
+        mock_get_reports.return_value = {
+            "aggregate_reports": [],
+            "failure_reports": [],
+            "smtp_tls_reports": [
+                {
+                    "organization_name": "Allowed Org",
+                    "begin_date": "2024-01-01T00:00:00Z",
+                    "end_date": "2024-01-01T23:59:59Z",
+                    "report_id": "allowed-1",
+                    "contact_info": "tls@allowed.example.com",
+                    "policies": [
+                        {
+                            "policy_domain": "allowed.example.com",
+                            "policy_type": "sts",
+                            "successful_session_count": 1,
+                            "failed_session_count": 0,
+                        }
+                    ],
+                },
+                {
+                    "organization_name": "Unmapped Org",
+                    "begin_date": "2024-01-01T00:00:00Z",
+                    "end_date": "2024-01-01T23:59:59Z",
+                    "report_id": "unmapped-1",
+                    "contact_info": "tls@unmapped.example.net",
+                    "policies": [
+                        {
+                            "policy_domain": "unmapped.example.net",
+                            "policy_type": "sts",
+                            "successful_session_count": 5,
+                            "failed_session_count": 0,
+                        }
+                    ],
+                },
+                {
+                    "organization_name": "Mixed Case Org",
+                    "begin_date": "2024-01-01T00:00:00Z",
+                    "end_date": "2024-01-01T23:59:59Z",
+                    "report_id": "mixed-case-1",
+                    "contact_info": "tls@mixedcase.example.com",
+                    "policies": [
+                        {
+                            "policy_domain": "MixedCase.Example.Com",
+                            "policy_type": "sts",
+                            "successful_session_count": 2,
+                            "failed_session_count": 0,
+                        }
+                    ],
+                },
+            ],
+        }
+
+        domain_map = {"tenant_a": ["example.com"]}
+        with NamedTemporaryFile("w", suffix=".yaml", delete=False) as map_file:
+            import yaml
+
+            yaml.dump(domain_map, map_file)
+            map_path = map_file.name
+        self.addCleanup(lambda: os.path.exists(map_path) and os.remove(map_path))
+
+        config = f"""[general]
+save_smtp_tls = true
+silent = false
+index_prefix_domain_map = {map_path}
+
+[imap]
+host = imap.example.com
+user = test-user
+password = test-password
+"""
+        with NamedTemporaryFile("w", suffix=".ini", delete=False) as config_file:
+            config_file.write(config)
+            config_path = config_file.name
+        self.addCleanup(lambda: os.path.exists(config_path) and os.remove(config_path))
+
+        captured = io.StringIO()
+        with patch.object(sys, "argv", ["parsedmarc", "-c", config_path]):
+            with patch("sys.stdout", captured):
+                parsedmarc.cli._main()
+
+        output = json.loads(captured.getvalue())
+        tls_reports = output["smtp_tls_reports"]
+        self.assertEqual(len(tls_reports), 2)
+        report_ids = {r["report_id"] for r in tls_reports}
+        self.assertIn("allowed-1", report_ids)
+        self.assertIn("mixed-case-1", report_ids)
+        self.assertNotIn("unmapped-1", report_ids)
+
+
+class TestConfigAliases(unittest.TestCase):
+    """Tests for config key aliases (env var friendly short names)."""
+
+    def test_maildir_create_alias(self):
+        """[maildir] create works as alias for maildir_create."""
+        from argparse import Namespace
+        from parsedmarc.cli import _load_config, _parse_config
+
+        env = {
+            "PARSEDMARC_MAILDIR_CREATE": "true",
+            "PARSEDMARC_MAILDIR_PATH": "/tmp/test",
+        }
+        with patch.dict(os.environ, env, clear=False):
+            config = _load_config(None)
+        opts = Namespace()
+        _parse_config(config, opts)
+        self.assertTrue(opts.maildir_create)
+
+    def test_maildir_path_alias(self):
+        """[maildir] path works as alias for maildir_path."""
+        from argparse import Namespace
+        from parsedmarc.cli import _load_config, _parse_config
+
+        env = {"PARSEDMARC_MAILDIR_PATH": "/var/mail/dmarc"}
+        with patch.dict(os.environ, env, clear=False):
+            config = _load_config(None)
+        opts = Namespace()
+        _parse_config(config, opts)
+        self.assertEqual(opts.maildir_path, "/var/mail/dmarc")
+
+    def test_msgraph_url_alias(self):
+        """[msgraph] url works as alias for graph_url."""
+        from parsedmarc.cli import _load_config, _parse_config
+        from argparse import Namespace
+
+        env = {
+            "PARSEDMARC_MSGRAPH_AUTH_METHOD": "ClientSecret",
+            "PARSEDMARC_MSGRAPH_CLIENT_ID": "test-id",
+            "PARSEDMARC_MSGRAPH_CLIENT_SECRET": "test-secret",
+            "PARSEDMARC_MSGRAPH_TENANT_ID": "test-tenant",
+            "PARSEDMARC_MSGRAPH_MAILBOX": "test@example.com",
+            "PARSEDMARC_MSGRAPH_URL": "https://custom.graph.example.com",
+        }
+        with patch.dict(os.environ, env, clear=False):
+            config = _load_config(None)
+        opts = Namespace()
+        _parse_config(config, opts)
+        self.assertEqual(opts.graph_url, "https://custom.graph.example.com")
+
+    def test_original_keys_still_work(self):
+        """Original INI key names (maildir_create, maildir_path) still work."""
+        from argparse import Namespace
+        from parsedmarc.cli import _parse_config
+
+        config = ConfigParser(interpolation=None)
+        config.add_section("maildir")
+        config.set("maildir", "maildir_path", "/original/path")
+        config.set("maildir", "maildir_create", "true")
+
+        opts = Namespace()
+        _parse_config(config, opts)
+        self.assertEqual(opts.maildir_path, "/original/path")
+        self.assertTrue(opts.maildir_create)
+
+    def test_ipinfo_url_option(self):
+        """[general] ipinfo_url lands on opts.ipinfo_url."""
+        from argparse import Namespace
+        from parsedmarc.cli import _parse_config
+
+        config = ConfigParser(interpolation=None)
+        config.add_section("general")
+        config.set("general", "ipinfo_url", "https://mirror.example/mmdb")
+
+        opts = Namespace()
+        _parse_config(config, opts)
+        self.assertEqual(opts.ipinfo_url, "https://mirror.example/mmdb")
+
+    def test_ip_db_url_deprecated_alias(self):
+        """[general] ip_db_url is accepted as an alias for ipinfo_url but
+        emits a deprecation warning."""
+        from argparse import Namespace
+        from parsedmarc.cli import _parse_config
+
+        config = ConfigParser(interpolation=None)
+        config.add_section("general")
+        config.set("general", "ip_db_url", "https://old.example/mmdb")
+
+        opts = Namespace()
+        with self.assertLogs("parsedmarc.log", level="WARNING") as cm:
+            _parse_config(config, opts)
+        self.assertEqual(opts.ipinfo_url, "https://old.example/mmdb")
+        self.assertTrue(
+            any("ip_db_url" in line and "deprecated" in line for line in cm.output),
+            f"expected deprecation warning, got: {cm.output}",
+        )
+
+
+class TestExpandPath(unittest.TestCase):
+    """Tests for _expand_path config path expansion."""
+
+    def test_expand_tilde(self):
+        from parsedmarc.cli import _expand_path
+
+        result = _expand_path("~/some/path")
+        self.assertFalse(result.startswith("~"))
+        self.assertTrue(result.endswith("/some/path"))
+
+    def test_expand_env_var(self):
+        from parsedmarc.cli import _expand_path
+
+        with patch.dict(os.environ, {"PARSEDMARC_TEST_DIR": "/opt/data"}):
+            result = _expand_path("$PARSEDMARC_TEST_DIR/tokens/.token")
+        self.assertEqual(result, "/opt/data/tokens/.token")
+
+    def test_expand_both(self):
+        from parsedmarc.cli import _expand_path
+
+        with patch.dict(os.environ, {"MY_APP": "parsedmarc"}):
+            result = _expand_path("~/$MY_APP/config")
+        self.assertNotIn("~", result)
+        self.assertIn("parsedmarc/config", result)
+
+    def test_no_expansion_needed(self):
+        from parsedmarc.cli import _expand_path
+
+        self.assertEqual(_expand_path("/absolute/path"), "/absolute/path")
+        self.assertEqual(_expand_path("relative/path"), "relative/path")
+
+
+if __name__ == "__main__":
+    unittest.main(verbosity=2)
diff --git a/tests/test_gelf.py b/tests/test_gelf.py
new file mode 100644
index 0000000..64ed4e6
--- /dev/null
+++ b/tests/test_gelf.py
@@ -0,0 +1,23 @@
+"""Tests for parsedmarc.gelf"""
+
+import unittest
+
+
+class Test(unittest.TestCase):
+    """Kitchen-sink tests redistributed from the original
+    tests.py monolith. Future PRs should split these further
+    into purpose-specific TestCase subclasses as natural
+    groupings emerge."""
+
+    def testGelfBackwardCompatAlias(self):
+        """GelfClient forensic alias points to failure method"""
+        from parsedmarc.gelf import GelfClient
+
+        self.assertIs(
+            GelfClient.save_forensic_report_to_gelf,  # type: ignore[attr-defined]
+            GelfClient.save_failure_report_to_gelf,
+        )
+
+
+if __name__ == "__main__":
+    unittest.main(verbosity=2)
diff --git a/tests/test_init.py b/tests/test_init.py
new file mode 100644
index 0000000..98cdb8a
--- /dev/null
+++ b/tests/test_init.py
@@ -0,0 +1,2310 @@
+"""Tests for the top-level parsedmarc package (parsedmarc/__init__.py).
+
+Covers the public parsing surface: parse_report_file, parse_report_email,
+parse_aggregate_report_xml, parse_failure_report, parse_smtp_tls_report_json,
+extract_report, get_dmarc_reports_from_mbox, and the CSV / JSON renderers.
+"""
+
+import json
+import os
+import unittest
+from datetime import datetime, timedelta, timezone
+from glob import glob
+from io import BytesIO
+from pathlib import Path
+from tempfile import NamedTemporaryFile
+from typing import BinaryIO, cast
+
+from lxml import etree  # type: ignore[import-untyped]
+
+import parsedmarc
+from parsedmarc.types import AggregateReport, FailureReport, SMTPTLSReport
+
+# Detect if running in GitHub Actions to skip DNS lookups
+OFFLINE_MODE = os.environ.get("GITHUB_ACTIONS", "false").lower() == "true"
+
+
+def minify_xml(xml_string):
+    parser = etree.XMLParser(remove_blank_text=True)
+    tree = etree.fromstring(xml_string.encode("utf-8"), parser)
+    return etree.tostring(tree, pretty_print=False).decode("utf-8")
+
+
+def compare_xml(xml1, xml2):
+    parser = etree.XMLParser(remove_blank_text=True)
+    tree1 = etree.fromstring(xml1.encode("utf-8"), parser)
+    tree2 = etree.fromstring(xml2.encode("utf-8"), parser)
+    return etree.tostring(tree1) == etree.tostring(tree2)
+
+
+class Test(unittest.TestCase):
+    """Kitchen-sink tests redistributed from the original
+    tests.py monolith. Future PRs should split these further
+    into purpose-specific TestCase subclasses as natural
+    groupings emerge."""
+
+    def testExtractReportXMLComparator(self):
+        """Test XML comparator function"""
+        with open("samples/extract_report/nice-input.xml") as f:
+            xmlnice = f.read()
+        with open("samples/extract_report/changed-input.xml") as f:
+            xmlchanged = minify_xml(f.read())
+        self.assertTrue(compare_xml(xmlnice, xmlnice))
+        self.assertTrue(compare_xml(xmlchanged, xmlchanged))
+        self.assertFalse(compare_xml(xmlnice, xmlchanged))
+        self.assertFalse(compare_xml(xmlchanged, xmlnice))
+        print("Passed!")
+
+    def testExtractReportBytes(self):
+        """Test extract report function for bytes string input"""
+        print()
+        file = "samples/extract_report/nice-input.xml"
+        with open(file, "rb") as f:
+            data = f.read()
+        print("Testing {0}: ".format(file), end="")
+        xmlout = parsedmarc.extract_report(data)
+        with open("samples/extract_report/nice-input.xml") as f:
+            xmlin = f.read()
+        self.assertTrue(compare_xml(xmlout, xmlin))
+        print("Passed!")
+
+    def testExtractReportXML(self):
+        """Test extract report function for XML input"""
+        print()
+        file = "samples/extract_report/nice-input.xml"
+        print("Testing {0}: ".format(file), end="")
+        xmlout = parsedmarc.extract_report_from_file_path(file)
+        with open("samples/extract_report/nice-input.xml") as f:
+            xmlin = f.read()
+        self.assertTrue(compare_xml(xmlout, xmlin))
+        print("Passed!")
+
+    def testExtractReportXMLFromPath(self):
+        """Test extract report function for pathlib.Path input"""
+        report_path = Path("samples/extract_report/nice-input.xml")
+        xmlout = parsedmarc.extract_report_from_file_path(report_path)
+        with open("samples/extract_report/nice-input.xml") as xmlin_file:
+            xmlin = xmlin_file.read()
+        self.assertTrue(compare_xml(xmlout, xmlin))
+
+    def testExtractReportGZip(self):
+        """Test extract report function for gzip input"""
+        print()
+        file = "samples/extract_report/nice-input.xml.gz"
+        print("Testing {0}: ".format(file), end="")
+        xmlout = parsedmarc.extract_report_from_file_path(file)
+        with open("samples/extract_report/nice-input.xml") as f:
+            xmlin = f.read()
+        self.assertTrue(compare_xml(xmlout, xmlin))
+        print("Passed!")
+
+    def testExtractReportZip(self):
+        """Test extract report function for zip input"""
+        print()
+        file = "samples/extract_report/nice-input.xml.zip"
+        print("Testing {0}: ".format(file), end="")
+        xmlout = parsedmarc.extract_report_from_file_path(file)
+        with open("samples/extract_report/nice-input.xml") as f:
+            xmlin = minify_xml(f.read())
+        self.assertTrue(compare_xml(xmlout, xmlin))
+        with open("samples/extract_report/changed-input.xml") as f:
+            xmlin = f.read()
+        self.assertFalse(compare_xml(xmlout, xmlin))
+        print("Passed!")
+
+    def testParseReportFileAcceptsPathForXML(self):
+        report_path = Path(
+            "samples/aggregate/protection.outlook.com!example.com!1711756800!1711843200.xml"
+        )
+        result = parsedmarc.parse_report_file(
+            report_path,
+            offline=True,
+        )
+        assert result["report_type"] == "aggregate"
+        report = cast(AggregateReport, result["report"])
+        self.assertEqual(report["report_metadata"]["org_name"], "outlook.com")
+
+    def testParseReportFileAcceptsPathForEmail(self):
+        report_path = Path(
+            "samples/aggregate/Report domain- borschow.com Submitter- google.com Report-ID- 949348866075514174.eml"
+        )
+        result = parsedmarc.parse_report_file(
+            report_path,
+            offline=True,
+        )
+        assert result["report_type"] == "aggregate"
+        report = cast(AggregateReport, result["report"])
+        self.assertEqual(report["report_metadata"]["org_name"], "google.com")
+
+    def testAggregateSamples(self):
+        """Test sample aggregate/rua DMARC reports"""
+        print()
+        sample_paths = glob("samples/aggregate/*")
+        for sample_path in sample_paths:
+            if os.path.isdir(sample_path):
+                continue
+            print("Testing {0}: ".format(sample_path), end="")
+            with self.subTest(sample=sample_path):
+                result = parsedmarc.parse_report_file(
+                    sample_path, always_use_local_files=True, offline=OFFLINE_MODE
+                )
+                assert result["report_type"] == "aggregate"
+                parsedmarc.parsed_aggregate_reports_to_csv(
+                    cast(AggregateReport, result["report"])
+                )
+            print("Passed!")
+
+    def testEmptySample(self):
+        """Test empty/unparasable report"""
+        with self.assertRaises(parsedmarc.ParserError):
+            parsedmarc.parse_report_file("samples/empty.xml", offline=OFFLINE_MODE)
+
+    def testFailureSamples(self):
+        """Test sample failure/ruf DMARC reports"""
+        print()
+        sample_paths = glob("samples/failure/*.eml")
+        for sample_path in sample_paths:
+            print("Testing {0}: ".format(sample_path), end="")
+            with self.subTest(sample=sample_path):
+                with open(sample_path) as sample_file:
+                    sample_content = sample_file.read()
+                    email_result = parsedmarc.parse_report_email(
+                        sample_content, offline=OFFLINE_MODE
+                    )
+                    assert email_result["report_type"] == "failure"
+                result = parsedmarc.parse_report_file(sample_path, offline=OFFLINE_MODE)
+                assert result["report_type"] == "failure"
+                parsedmarc.parsed_failure_reports_to_csv(
+                    cast(FailureReport, result["report"])
+                )
+            print("Passed!")
+
+    def testFailureReportBackwardCompat(self):
+        """Test that old forensic function aliases still work"""
+        self.assertIs(
+            parsedmarc.parse_forensic_report,
+            parsedmarc.parse_failure_report,
+        )
+        self.assertIs(
+            parsedmarc.parsed_forensic_reports_to_csv,
+            parsedmarc.parsed_failure_reports_to_csv,
+        )
+        self.assertIs(
+            parsedmarc.parsed_forensic_reports_to_csv_rows,
+            parsedmarc.parsed_failure_reports_to_csv_rows,
+        )
+        self.assertIs(
+            parsedmarc.InvalidForensicReport,
+            parsedmarc.InvalidFailureReport,
+        )
+
+    def testRFC9990SampleReport(self):
+        """Test parsing the sample report from RFC 9990 Appendix B"""
+        print()
+        sample_path = "samples/aggregate/rfc9990-sample.xml"
+        print("Testing {0}: ".format(sample_path), end="")
+        result = parsedmarc.parse_report_file(
+            sample_path, always_use_local_files=True, offline=True
+        )
+        report = cast(AggregateReport, result["report"])
+
+        # Verify report_type
+        self.assertEqual(result["report_type"], "aggregate")
+
+        # Verify xml_schema
+        self.assertEqual(report["xml_schema"], "1.0")
+
+        # Verify report_metadata
+        metadata = report["report_metadata"]
+        self.assertEqual(metadata["org_name"], "Sample Reporter")
+        self.assertEqual(metadata["org_email"], "report_sender@example-reporter.com")
+        self.assertEqual(metadata["org_extra_contact_info"], "...")
+        self.assertEqual(metadata["report_id"], "3v98abbp8ya9n3va8yr8oa3ya")
+        self.assertEqual(
+            metadata["generator"],
+            "Example DMARC Aggregate Reporter v1.2",
+        )
+
+        # Verify RFC 9990 policy_published fields
+        pp = report["policy_published"]
+        self.assertEqual(pp["domain"], "example.com")
+        self.assertEqual(pp["p"], "quarantine")
+        self.assertEqual(pp["sp"], "none")
+        self.assertEqual(pp["np"], "none")
+        self.assertEqual(pp["testing"], "n")
+        self.assertEqual(pp["discovery_method"], "treewalk")
+        # adkim/aspf default when not in XML
+        self.assertEqual(pp["adkim"], "r")
+        self.assertEqual(pp["aspf"], "r")
+        # pct is removed in RFC 9989 (and so absent from the RFC 9990
+        # sample); fo is still part of RFC 9990's PolicyPublishedType but
+        # the appendix sample happens not to set it.
+        self.assertIsNone(pp["pct"])
+        self.assertIsNone(pp["fo"])
+
+        # Verify record
+        self.assertEqual(len(report["records"]), 1)
+        rec = report["records"][0]
+        self.assertEqual(rec["source"]["ip_address"], "192.0.2.123")
+        self.assertEqual(rec["count"], 123)
+        self.assertEqual(rec["policy_evaluated"]["disposition"], "pass")
+        self.assertEqual(rec["policy_evaluated"]["dkim"], "pass")
+        self.assertEqual(rec["policy_evaluated"]["spf"], "fail")
+
+        # Verify DKIM auth result with human_result
+        self.assertEqual(len(rec["auth_results"]["dkim"]), 1)
+        dkim = rec["auth_results"]["dkim"][0]
+        self.assertEqual(dkim["domain"], "example.com")
+        self.assertEqual(dkim["selector"], "abc123")
+        self.assertEqual(dkim["result"], "pass")
+        self.assertIsNone(dkim["human_result"])
+
+        # Verify SPF auth result with human_result
+        self.assertEqual(len(rec["auth_results"]["spf"]), 1)
+        spf = rec["auth_results"]["spf"][0]
+        self.assertEqual(spf["domain"], "example.com")
+        self.assertEqual(spf["result"], "fail")
+        self.assertIsNone(spf["human_result"])
+
+        # Verify CSV output includes new fields
+        csv = parsedmarc.parsed_aggregate_reports_to_csv(report)
+        header = csv.split("\n")[0]
+        self.assertIn("np", header.split(","))
+        self.assertIn("testing", header.split(","))
+        self.assertIn("discovery_method", header.split(","))
+        print("Passed!")
+
+    def testRFC9990FieldsAbsentFromRFC7489Report(self):
+        """Test that RFC 7489 reports have None for RFC 9990-only fields"""
+        print()
+        sample_path = (
+            "samples/aggregate/example.net!example.com!1529366400!1529452799.xml"
+        )
+        print("Testing {0}: ".format(sample_path), end="")
+        result = parsedmarc.parse_report_file(
+            sample_path, always_use_local_files=True, offline=True
+        )
+        report = cast(AggregateReport, result["report"])
+        pp = report["policy_published"]
+
+        # RFC 7489 fields present
+        self.assertEqual(pp["pct"], "100")
+        self.assertEqual(pp["fo"], "0")
+
+        # RFC 9990-only fields absent (None)
+        self.assertIsNone(pp["np"])
+        self.assertIsNone(pp["testing"])
+        self.assertIsNone(pp["discovery_method"])
+
+        # generator absent (None)
+        self.assertIsNone(report["report_metadata"]["generator"])
+        print("Passed!")
+
+    def testRFC9990WithExplicitFields(self):
+        """Test RFC 9990 report with explicit testing and discovery_method"""
+        print()
+        sample_path = (
+            "samples/aggregate/"
+            "rfc9990-example.net!example.com!1700000000!1700086399.xml"
+        )
+        print("Testing {0}: ".format(sample_path), end="")
+        result = parsedmarc.parse_report_file(
+            sample_path, always_use_local_files=True, offline=True
+        )
+        report = cast(AggregateReport, result["report"])
+        pp = report["policy_published"]
+
+        self.assertEqual(pp["np"], "reject")
+        self.assertEqual(pp["testing"], "y")
+        self.assertEqual(pp["discovery_method"], "treewalk")
+        print("Passed!")
+
+    def testRFC9990NamespaceCaptured(self):
+        """The dmarc-2.0 namespace on <feedback> is preserved on the
+        parsed report so consumers can distinguish RFC 9990 from RFC 7489
+        reports without inferring from the version element value."""
+        result = parsedmarc.parse_report_file(
+            "samples/aggregate/rfc9990-sample.xml",
+            always_use_local_files=True,
+            offline=True,
+        )
+        report = cast(AggregateReport, result["report"])
+        self.assertEqual(
+            report["xml_namespace"],
+            "urn:ietf:params:xml:ns:dmarc-2.0",
+        )
+
+    def testRFC9990NamespaceAbsentOnRFC7489Report(self):
+        """RFC 7489 reports don't declare the dmarc-2.0 namespace, so
+        xml_namespace is None."""
+        result = parsedmarc.parse_report_file(
+            "samples/aggregate/example.net!example.com!1529366400!1529452799.xml",
+            always_use_local_files=True,
+            offline=True,
+        )
+        report = cast(AggregateReport, result["report"])
+        self.assertIsNone(report["xml_namespace"])
+
+    def testRFC9990DetectionAcceptsNamespacelessReports(self):
+        """A report that follows the RFC 9990 shape without declaring the
+        namespace (e.g. emits np/testing/discovery_method) is still
+        treated as RFC 9990 for validation purposes — warnings fire,
+        the namespace field reports it honestly as absent."""
+        with self.assertLogs("parsedmarc.log", level="WARNING") as cm:
+            report = parsedmarc.parse_aggregate_report_xml(
+                """<?xml version="1.0"?>
+                <feedback>
+                    <report_metadata>
+                        <org_name>Test</org_name>
+                        <email>t@example.com</email>
+                        <report_id>r1</report_id>
+                        <date_range><begin>1700000000</begin><end>1700086399</end></date_range>
+                    </report_metadata>
+                    <policy_published>
+                        <domain>example.com</domain>
+                        <p>none</p>
+                        <np>reject</np>
+                    </policy_published>
+                    <record>
+                        <row>
+                            <source_ip>192.0.2.1</source_ip>
+                            <count>1</count>
+                            <policy_evaluated>
+                                <disposition>none</disposition>
+                                <dkim>pass</dkim>
+                                <spf>pass</spf>
+                            </policy_evaluated>
+                        </row>
+                        <identifiers><header_from>example.com</header_from></identifiers>
+                        <auth_results>
+                            <dkim>
+                                <domain>example.com</domain>
+                                <result>pass</result>
+                            </dkim>
+                        </auth_results>
+                    </record>
+                </feedback>""",
+                offline=True,
+            )
+        # Namespace honestly None because none was declared.
+        self.assertIsNone(report["xml_namespace"])
+        # RFC 9990 detection still fired (DKIM selector warning emitted).
+        self.assertTrue(
+            any("selector" in msg for msg in cm.output),
+            f"Expected DKIM selector warning; got: {cm.output}",
+        )
+
+    def testRFC9990DKIMMissingSelectorWarning(self):
+        """A DKIM auth result with no <selector> in an RFC 9990 report
+        (namespace declared) emits a warning since selector is REQUIRED."""
+        xml = """<?xml version="1.0"?>
+        <feedback xmlns="urn:ietf:params:xml:ns:dmarc-2.0">
+            <version>1.0</version>
+            <report_metadata>
+                <org_name>Test</org_name>
+                <email>t@example.com</email>
+                <report_id>r1</report_id>
+                <date_range><begin>1700000000</begin><end>1700086399</end></date_range>
+            </report_metadata>
+            <policy_published>
+                <domain>example.com</domain>
+                <p>none</p>
+            </policy_published>
+            <record>
+                <row>
+                    <source_ip>192.0.2.1</source_ip>
+                    <count>1</count>
+                    <policy_evaluated>
+                        <disposition>none</disposition>
+                        <dkim>pass</dkim>
+                        <spf>pass</spf>
+                    </policy_evaluated>
+                </row>
+                <identifiers><header_from>example.com</header_from></identifiers>
+                <auth_results>
+                    <dkim>
+                        <domain>example.com</domain>
+                        <result>pass</result>
+                    </dkim>
+                </auth_results>
+            </record>
+        </feedback>"""
+        with self.assertLogs("parsedmarc.log", level="WARNING") as cm:
+            parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+        self.assertTrue(
+            any("selector" in m and "REQUIRED" in m for m in cm.output),
+            f"Expected selector REQUIRED warning; got: {cm.output}",
+        )
+
+    def testRFC9990LegacyOverrideTypeWarning(self):
+        """`forwarded` and `sampled_out` were removed in RFC 9990;
+        a warning fires when they appear in an RFC 9990 report."""
+        xml = """<?xml version="1.0"?>
+        <feedback xmlns="urn:ietf:params:xml:ns:dmarc-2.0">
+            <report_metadata>
+                <org_name>Test</org_name>
+                <email>t@example.com</email>
+                <report_id>r1</report_id>
+                <date_range><begin>1700000000</begin><end>1700086399</end></date_range>
+            </report_metadata>
+            <policy_published>
+                <domain>example.com</domain>
+                <p>none</p>
+            </policy_published>
+            <record>
+                <row>
+                    <source_ip>192.0.2.1</source_ip>
+                    <count>1</count>
+                    <policy_evaluated>
+                        <disposition>none</disposition>
+                        <dkim>pass</dkim>
+                        <spf>pass</spf>
+                        <reason><type>forwarded</type></reason>
+                    </policy_evaluated>
+                </row>
+                <identifiers><header_from>example.com</header_from></identifiers>
+                <auth_results>
+                    <dkim>
+                        <domain>example.com</domain>
+                        <selector>s</selector>
+                        <result>pass</result>
+                    </dkim>
+                </auth_results>
+            </record>
+        </feedback>"""
+        with self.assertLogs("parsedmarc.log", level="WARNING") as cm:
+            parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+        self.assertTrue(
+            any("forwarded" in m and "removed in RFC 9990" in m for m in cm.output),
+            f"Expected legacy override warning; got: {cm.output}",
+        )
+
+    def testRFC9990LangAttrStringUnwrapped(self):
+        """When a langAttrString element (extra_contact_info, error,
+        comment, human_result) carries a lang attribute, xmltodict turns
+        it into {"#text": "...", "@lang": "en"}; the parser must unwrap
+        to the text payload so the report stays comparable to one
+        without the lang attribute."""
+        xml = """<?xml version="1.0"?>
+        <feedback xmlns="urn:ietf:params:xml:ns:dmarc-2.0">
+            <report_metadata>
+                <org_name>Test</org_name>
+                <email>t@example.com</email>
+                <extra_contact_info xml:lang="en">contact-here</extra_contact_info>
+                <report_id>r1</report_id>
+                <date_range><begin>1700000000</begin><end>1700086399</end></date_range>
+                <error xml:lang="en">a problem</error>
+            </report_metadata>
+            <policy_published>
+                <domain>example.com</domain>
+                <p>none</p>
+            </policy_published>
+            <record>
+                <row>
+                    <source_ip>192.0.2.1</source_ip>
+                    <count>1</count>
+                    <policy_evaluated>
+                        <disposition>none</disposition>
+                        <dkim>pass</dkim>
+                        <spf>pass</spf>
+                        <reason>
+                            <type>local_policy</type>
+                            <comment xml:lang="en">a comment</comment>
+                        </reason>
+                    </policy_evaluated>
+                </row>
+                <identifiers><header_from>example.com</header_from></identifiers>
+                <auth_results>
+                    <dkim>
+                        <domain>example.com</domain>
+                        <selector>s</selector>
+                        <result>pass</result>
+                        <human_result xml:lang="en">looks fine</human_result>
+                    </dkim>
+                    <spf>
+                        <domain>example.com</domain>
+                        <result>pass</result>
+                        <human_result xml:lang="en">spf-detail</human_result>
+                    </spf>
+                </auth_results>
+            </record>
+        </feedback>"""
+        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+        self.assertEqual(
+            report["report_metadata"]["org_extra_contact_info"], "contact-here"
+        )
+        self.assertEqual(report["report_metadata"]["errors"], ["a problem"])
+        rec = report["records"][0]
+        reasons = rec["policy_evaluated"]["policy_override_reasons"]
+        self.assertEqual(reasons[0]["comment"], "a comment")
+        self.assertEqual(rec["auth_results"]["dkim"][0]["human_result"], "looks fine")
+        self.assertEqual(rec["auth_results"]["spf"][0]["human_result"], "spf-detail")
+
+    def testSmtpTlsSamples(self):
+        """Test sample SMTP TLS reports"""
+        print()
+        sample_paths = glob("samples/smtp_tls/*")
+        for sample_path in sample_paths:
+            if os.path.isdir(sample_path):
+                continue
+            print("Testing {0}: ".format(sample_path), end="")
+            with self.subTest(sample=sample_path):
+                result = parsedmarc.parse_report_file(sample_path, offline=OFFLINE_MODE)
+                assert result["report_type"] == "smtp_tls"
+                parsedmarc.parsed_smtp_tls_reports_to_csv(
+                    cast(SMTPTLSReport, result["report"])
+                )
+            print("Passed!")
+
+    def testAggregateCsvExposesASNColumns(self):
+        """The aggregate CSV output should include source_asn, source_as_name,
+        and source_as_domain columns."""
+        result = parsedmarc.parse_report_file(
+            "samples/aggregate/!example.com!1538204542!1538463818.xml",
+            always_use_local_files=True,
+            offline=True,
+        )
+        csv_text = parsedmarc.parsed_aggregate_reports_to_csv(result["report"])
+        header = csv_text.splitlines()[0].split(",")
+        self.assertIn("source_asn", header)
+        self.assertIn("source_as_name", header)
+        self.assertIn("source_as_domain", header)
+
+    def testBucketIntervalBeginAfterEnd(self):
+        """begin > end should raise ValueError"""
+        begin = datetime(2024, 1, 2, tzinfo=timezone.utc)
+        end = datetime(2024, 1, 1, tzinfo=timezone.utc)
+        with self.assertRaises(ValueError):
+            parsedmarc._bucket_interval_by_day(begin, end, 100)
+
+    def testBucketIntervalNaiveDatetime(self):
+        """Non-timezone-aware datetimes should raise ValueError"""
+        begin = datetime(2024, 1, 1)
+        end = datetime(2024, 1, 2)
+        with self.assertRaises(ValueError):
+            parsedmarc._bucket_interval_by_day(begin, end, 100)
+
+    def testBucketIntervalDifferentTzinfo(self):
+        """Different tzinfo objects should raise ValueError"""
+        tz1 = timezone.utc
+        tz2 = timezone(timedelta(hours=5))
+        begin = datetime(2024, 1, 1, tzinfo=tz1)
+        end = datetime(2024, 1, 2, tzinfo=tz2)
+        with self.assertRaises(ValueError):
+            parsedmarc._bucket_interval_by_day(begin, end, 100)
+
+    def testBucketIntervalNegativeCount(self):
+        """Negative total_count should raise ValueError"""
+        begin = datetime(2024, 1, 1, tzinfo=timezone.utc)
+        end = datetime(2024, 1, 2, tzinfo=timezone.utc)
+        with self.assertRaises(ValueError):
+            parsedmarc._bucket_interval_by_day(begin, end, -1)
+
+    def testBucketIntervalZeroCount(self):
+        """Zero total_count should return empty list"""
+        begin = datetime(2024, 1, 1, tzinfo=timezone.utc)
+        end = datetime(2024, 1, 2, tzinfo=timezone.utc)
+        result = parsedmarc._bucket_interval_by_day(begin, end, 0)
+        self.assertEqual(result, [])
+
+    def testBucketIntervalSameBeginEnd(self):
+        """Same begin and end (zero interval) should return empty list"""
+        dt = datetime(2024, 1, 1, 12, 0, 0, tzinfo=timezone.utc)
+        result = parsedmarc._bucket_interval_by_day(dt, dt, 100)
+        self.assertEqual(result, [])
+
+    def testBucketIntervalSingleDay(self):
+        """Single day interval should return one bucket with total count"""
+        begin = datetime(2024, 1, 1, 0, 0, 0, tzinfo=timezone.utc)
+        end = datetime(2024, 1, 1, 23, 59, 59, tzinfo=timezone.utc)
+        result = parsedmarc._bucket_interval_by_day(begin, end, 100)
+        self.assertEqual(len(result), 1)
+        self.assertEqual(result[0]["count"], 100)
+        self.assertEqual(result[0]["begin"], begin)
+
+    def testBucketIntervalMultiDay(self):
+        """Multi-day interval should distribute counts proportionally"""
+        begin = datetime(2024, 1, 1, 0, 0, 0, tzinfo=timezone.utc)
+        end = datetime(2024, 1, 3, 0, 0, 0, tzinfo=timezone.utc)
+        result = parsedmarc._bucket_interval_by_day(begin, end, 100)
+        self.assertEqual(len(result), 2)
+        total = sum(b["count"] for b in result)
+        self.assertEqual(total, 100)
+        # Equal days => equal distribution
+        self.assertEqual(result[0]["count"], 50)
+        self.assertEqual(result[1]["count"], 50)
+
+    def testBucketIntervalRemainderDistribution(self):
+        """Odd count across equal days distributes remainder correctly"""
+        begin = datetime(2024, 1, 1, 0, 0, 0, tzinfo=timezone.utc)
+        end = datetime(2024, 1, 4, 0, 0, 0, tzinfo=timezone.utc)
+        result = parsedmarc._bucket_interval_by_day(begin, end, 10)
+        total = sum(b["count"] for b in result)
+        self.assertEqual(total, 10)
+        self.assertEqual(len(result), 3)
+
+    def testBucketIntervalPartialDays(self):
+        """Partial days: 12h on day1, 24h on day2 => 1/3 vs 2/3 split"""
+        begin = datetime(2024, 1, 1, 12, 0, 0, tzinfo=timezone.utc)
+        end = datetime(2024, 1, 3, 0, 0, 0, tzinfo=timezone.utc)
+        result = parsedmarc._bucket_interval_by_day(begin, end, 90)
+        total = sum(b["count"] for b in result)
+        self.assertEqual(total, 90)
+        # day1: 12h, day2: 24h => 1/3 vs 2/3
+        self.assertEqual(result[0]["count"], 30)
+        self.assertEqual(result[1]["count"], 60)
+
+    def testAppendParsedRecordNoNormalize(self):
+        """No normalization: record appended as-is with interval fields"""
+        records = []
+        rec = {"count": 10, "source": {"ip_address": "1.2.3.4"}}
+        begin = datetime(2024, 1, 1, 0, 0, 0, tzinfo=timezone.utc)
+        end = datetime(2024, 1, 2, 0, 0, 0, tzinfo=timezone.utc)
+        parsedmarc._append_parsed_record(rec, records, begin, end, False)
+        self.assertEqual(len(records), 1)
+        self.assertFalse(records[0]["normalized_timespan"])  # type: ignore[typeddict-item]
+        self.assertEqual(records[0]["interval_begin"], "2024-01-01 00:00:00")
+        self.assertEqual(records[0]["interval_end"], "2024-01-02 00:00:00")
+
+    def testAppendParsedRecordNormalize(self):
+        """Normalization: record split into daily buckets"""
+        records = []
+        rec = {"count": 100, "source": {"ip_address": "1.2.3.4"}}
+        begin = datetime(2024, 1, 1, 0, 0, 0, tzinfo=timezone.utc)
+        end = datetime(2024, 1, 3, 0, 0, 0, tzinfo=timezone.utc)
+        parsedmarc._append_parsed_record(rec, records, begin, end, True)
+        self.assertEqual(len(records), 2)
+        total = sum(r["count"] for r in records)
+        self.assertEqual(total, 100)
+        for r in records:
+            self.assertTrue(r["normalized_timespan"])  # type: ignore[typeddict-item]
+
+    def testAppendParsedRecordNormalizeZeroCount(self):
+        """Normalization with zero count: nothing appended"""
+        records = []
+        rec = {"count": 0, "source": {"ip_address": "1.2.3.4"}}
+        begin = datetime(2024, 1, 1, 0, 0, 0, tzinfo=timezone.utc)
+        end = datetime(2024, 1, 3, 0, 0, 0, tzinfo=timezone.utc)
+        parsedmarc._append_parsed_record(rec, records, begin, end, True)
+        self.assertEqual(len(records), 0)
+
+    def testParseReportRecordNoneSourceIP(self):
+        """Record with None source_ip should raise ValueError"""
+        record = {
+            "row": {
+                "source_ip": None,
+                "count": "1",
+                "policy_evaluated": {
+                    "disposition": "none",
+                    "dkim": "pass",
+                    "spf": "pass",
+                },
+            },
+            "identifiers": {"header_from": "example.com"},
+            "auth_results": {"dkim": [], "spf": []},
+        }
+        with self.assertRaises(ValueError):
+            parsedmarc._parse_report_record(record, offline=True)
+
+    def testParseReportRecordMissingDkimSpf(self):
+        """Record with missing dkim/spf auth results defaults correctly"""
+        record = {
+            "row": {
+                "source_ip": "192.0.2.1",
+                "count": "5",
+                "policy_evaluated": {
+                    "disposition": "none",
+                    "dkim": "pass",
+                    "spf": "fail",
+                },
+            },
+            "identifiers": {"header_from": "example.com"},
+            "auth_results": {},
+        }
+        result = parsedmarc._parse_report_record(record, offline=True)
+        self.assertEqual(result["auth_results"]["dkim"], [])
+        self.assertEqual(result["auth_results"]["spf"], [])
+
+    def testParseReportRecordReasonHandling(self):
+        """Reasons in policy_evaluated get normalized with comment default"""
+        record = {
+            "row": {
+                "source_ip": "192.0.2.1",
+                "count": "1",
+                "policy_evaluated": {
+                    "disposition": "none",
+                    "dkim": "pass",
+                    "spf": "pass",
+                    "reason": {"type": "forwarded"},
+                },
+            },
+            "identifiers": {"header_from": "example.com"},
+            "auth_results": {"dkim": [], "spf": []},
+        }
+        result = parsedmarc._parse_report_record(record, offline=True)
+        reasons = result["policy_evaluated"]["policy_override_reasons"]
+        self.assertEqual(len(reasons), 1)
+        self.assertEqual(reasons[0]["type"], "forwarded")
+        self.assertIsNone(reasons[0]["comment"])
+
+    def testParseReportRecordReasonList(self):
+        """Multiple reasons as a list are preserved"""
+        record = {
+            "row": {
+                "source_ip": "192.0.2.1",
+                "count": "1",
+                "policy_evaluated": {
+                    "disposition": "none",
+                    "dkim": "pass",
+                    "spf": "pass",
+                    "reason": [
+                        {"type": "forwarded", "comment": "relay"},
+                        {"type": "local_policy"},
+                    ],
+                },
+            },
+            "identifiers": {"header_from": "example.com"},
+            "auth_results": {"dkim": [], "spf": []},
+        }
+        result = parsedmarc._parse_report_record(record, offline=True)
+        reasons = result["policy_evaluated"]["policy_override_reasons"]
+        self.assertEqual(len(reasons), 2)
+        self.assertEqual(reasons[0]["comment"], "relay")
+        self.assertIsNone(reasons[1]["comment"])
+
+    def testParseReportRecordIdentities(self):
+        """'identities' key is mapped to 'identifiers'"""
+        record = {
+            "row": {
+                "source_ip": "192.0.2.1",
+                "count": "1",
+                "policy_evaluated": {
+                    "disposition": "none",
+                    "dkim": "pass",
+                    "spf": "pass",
+                },
+            },
+            "identities": {
+                "header_from": "Example.COM",
+                "envelope_from": "example.com",
+            },
+            "auth_results": {"dkim": [], "spf": []},
+        }
+        result = parsedmarc._parse_report_record(record, offline=True)
+        self.assertIn("identifiers", result)
+        self.assertEqual(result["identifiers"]["header_from"], "example.com")
+
+    def testParseReportRecordDkimDefaults(self):
+        """DKIM result defaults: selector='none', result='none' when missing"""
+        record = {
+            "row": {
+                "source_ip": "192.0.2.1",
+                "count": "1",
+                "policy_evaluated": {
+                    "disposition": "none",
+                    "dkim": "fail",
+                    "spf": "fail",
+                },
+            },
+            "identifiers": {"header_from": "example.com"},
+            "auth_results": {
+                "dkim": {"domain": "example.com"},
+                "spf": [],
+            },
+        }
+        result = parsedmarc._parse_report_record(record, offline=True)
+        dkim = result["auth_results"]["dkim"][0]
+        self.assertEqual(dkim["selector"], "none")
+        self.assertEqual(dkim["result"], "none")
+        self.assertIsNone(dkim["human_result"])
+
+    def testParseReportRecordSpfDefaults(self):
+        """SPF result defaults: scope='mfrom', result='none' when missing"""
+        record = {
+            "row": {
+                "source_ip": "192.0.2.1",
+                "count": "1",
+                "policy_evaluated": {
+                    "disposition": "none",
+                    "dkim": "fail",
+                    "spf": "fail",
+                },
+            },
+            "identifiers": {"header_from": "example.com"},
+            "auth_results": {
+                "dkim": [],
+                "spf": {"domain": "example.com"},
+            },
+        }
+        result = parsedmarc._parse_report_record(record, offline=True)
+        spf = result["auth_results"]["spf"][0]
+        self.assertEqual(spf["scope"], "mfrom")
+        self.assertEqual(spf["result"], "none")
+        self.assertIsNone(spf["human_result"])
+
+    def testParseReportRecordHumanResult(self):
+        """human_result field is included when present"""
+        record = {
+            "row": {
+                "source_ip": "192.0.2.1",
+                "count": "1",
+                "policy_evaluated": {
+                    "disposition": "none",
+                    "dkim": "pass",
+                    "spf": "pass",
+                },
+            },
+            "identifiers": {"header_from": "example.com"},
+            "auth_results": {
+                "dkim": [
+                    {
+                        "domain": "example.com",
+                        "selector": "s1",
+                        "result": "pass",
+                        "human_result": "good key",
+                    }
+                ],
+                "spf": [
+                    {
+                        "domain": "example.com",
+                        "scope": "mfrom",
+                        "result": "pass",
+                        "human_result": "sender valid",
+                    }
+                ],
+            },
+        }
+        result = parsedmarc._parse_report_record(record, offline=True)
+        self.assertEqual(result["auth_results"]["dkim"][0]["human_result"], "good key")
+        self.assertEqual(
+            result["auth_results"]["spf"][0]["human_result"], "sender valid"
+        )
+
+    def testParseReportRecordEnvelopeFromFallback(self):
+        """envelope_from falls back to last SPF domain when missing"""
+        record = {
+            "row": {
+                "source_ip": "192.0.2.1",
+                "count": "1",
+                "policy_evaluated": {
+                    "disposition": "none",
+                    "dkim": "pass",
+                    "spf": "pass",
+                },
+            },
+            "identifiers": {"header_from": "example.com"},
+            "auth_results": {
+                "dkim": [],
+                "spf": [
+                    {"domain": "Bounce.Example.COM", "scope": "mfrom", "result": "pass"}
+                ],
+            },
+        }
+        result = parsedmarc._parse_report_record(record, offline=True)
+        self.assertEqual(result["identifiers"]["envelope_from"], "bounce.example.com")
+
+    def testParseReportRecordEnvelopeFromNullFallback(self):
+        """envelope_from None value falls back to SPF domain"""
+        record = {
+            "row": {
+                "source_ip": "192.0.2.1",
+                "count": "1",
+                "policy_evaluated": {
+                    "disposition": "none",
+                    "dkim": "pass",
+                    "spf": "pass",
+                },
+            },
+            "identifiers": {
+                "header_from": "example.com",
+                "envelope_from": None,
+            },
+            "auth_results": {
+                "dkim": [],
+                "spf": [
+                    {"domain": "SPF.Example.COM", "scope": "mfrom", "result": "pass"}
+                ],
+            },
+        }
+        result = parsedmarc._parse_report_record(record, offline=True)
+        self.assertEqual(result["identifiers"]["envelope_from"], "spf.example.com")
+
+    def testParseReportRecordEnvelopeTo(self):
+        """envelope_to is preserved and moved correctly"""
+        record = {
+            "row": {
+                "source_ip": "192.0.2.1",
+                "count": "1",
+                "policy_evaluated": {
+                    "disposition": "none",
+                    "dkim": "pass",
+                    "spf": "pass",
+                },
+            },
+            "identifiers": {
+                "header_from": "example.com",
+                "envelope_from": "bounce@example.com",
+                "envelope_to": "recipient@example.com",
+            },
+            "auth_results": {"dkim": [], "spf": []},
+        }
+        result = parsedmarc._parse_report_record(record, offline=True)
+        self.assertEqual(result["identifiers"]["envelope_to"], "recipient@example.com")
+
+    def testParseReportRecordAlignment(self):
+        """Alignment fields computed correctly from policy_evaluated"""
+        record = {
+            "row": {
+                "source_ip": "192.0.2.1",
+                "count": "1",
+                "policy_evaluated": {
+                    "disposition": "none",
+                    "dkim": "pass",
+                    "spf": "fail",
+                },
+            },
+            "identifiers": {"header_from": "example.com"},
+            "auth_results": {"dkim": [], "spf": []},
+        }
+        result = parsedmarc._parse_report_record(record, offline=True)
+        self.assertTrue(result["alignment"]["dkim"])
+        self.assertFalse(result["alignment"]["spf"])
+        self.assertTrue(result["alignment"]["dmarc"])
+
+    def testParseSmtpTlsFailureDetailsMinimal(self):
+        """Minimal failure details with just required fields"""
+        details = {
+            "result-type": "certificate-expired",
+            "failed-session-count": 5,
+        }
+        result = parsedmarc._parse_smtp_tls_failure_details(details)
+        self.assertEqual(result["result_type"], "certificate-expired")
+        self.assertEqual(result["failed_session_count"], 5)
+        self.assertNotIn("sending_mta_ip", result)
+
+    def testParseSmtpTlsFailureDetailsAllOptional(self):
+        """All optional fields included"""
+        details = {
+            "result-type": "starttls-not-supported",
+            "failed-session-count": 3,
+            "sending-mta-ip": "10.0.0.1",
+            "receiving-ip": "10.0.0.2",
+            "receiving-mx-hostname": "mx.example.com",
+            "receiving-mx-helo": "mx.example.com",
+            "additional-info-uri": "https://example.com/info",
+            "failure-reason-code": "TLS_ERROR",
+        }
+        result = parsedmarc._parse_smtp_tls_failure_details(details)
+        self.assertEqual(result["sending_mta_ip"], "10.0.0.1")
+        self.assertEqual(result["receiving_ip"], "10.0.0.2")
+        self.assertEqual(result["receiving_mx_hostname"], "mx.example.com")
+        self.assertEqual(result["receiving_mx_helo"], "mx.example.com")
+        self.assertEqual(result["additional_info_uri"], "https://example.com/info")
+        self.assertEqual(result["failure_reason_code"], "TLS_ERROR")
+
+    def testParseSmtpTlsFailureDetailsMissingRequired(self):
+        """Missing required field raises InvalidSMTPTLSReport"""
+        with self.assertRaises(parsedmarc.InvalidSMTPTLSReport):
+            parsedmarc._parse_smtp_tls_failure_details({"result-type": "err"})
+
+    def testParseSmtpTlsReportPolicyValid(self):
+        """Valid STS policy parses correctly"""
+        policy = {
+            "policy": {
+                "policy-type": "sts",
+                "policy-domain": "example.com",
+                "policy-string": ["version: STSv1", "mode: enforce"],
+                "mx-host-pattern": ["*.example.com"],
+            },
+            "summary": {
+                "total-successful-session-count": 100,
+                "total-failure-session-count": 2,
+            },
+        }
+        result = parsedmarc._parse_smtp_tls_report_policy(policy)
+        self.assertEqual(result["policy_type"], "sts")
+        self.assertEqual(result["policy_domain"], "example.com")
+        self.assertEqual(result["policy_strings"], ["version: STSv1", "mode: enforce"])
+        self.assertEqual(result["mx_host_patterns"], ["*.example.com"])
+        self.assertEqual(result["successful_session_count"], 100)
+        self.assertEqual(result["failed_session_count"], 2)
+
+    def testParseSmtpTlsReportPolicyInvalidType(self):
+        """Invalid policy type raises InvalidSMTPTLSReport"""
+        policy = {
+            "policy": {
+                "policy-type": "invalid",
+                "policy-domain": "example.com",
+            },
+            "summary": {
+                "total-successful-session-count": 0,
+                "total-failure-session-count": 0,
+            },
+        }
+        with self.assertRaises(parsedmarc.InvalidSMTPTLSReport):
+            parsedmarc._parse_smtp_tls_report_policy(policy)
+
+    def testParseSmtpTlsReportPolicyEmptyPolicyString(self):
+        """Empty policy-string list is not included"""
+        policy = {
+            "policy": {
+                "policy-type": "sts",
+                "policy-domain": "example.com",
+                "policy-string": [],
+                "mx-host-pattern": [],
+            },
+            "summary": {
+                "total-successful-session-count": 50,
+                "total-failure-session-count": 0,
+            },
+        }
+        result = parsedmarc._parse_smtp_tls_report_policy(policy)
+        self.assertNotIn("policy_strings", result)
+        self.assertNotIn("mx_host_patterns", result)
+
+    def testParseSmtpTlsReportPolicyWithFailureDetails(self):
+        """Policy with failure-details parses nested details"""
+        policy = {
+            "policy": {
+                "policy-type": "sts",
+                "policy-domain": "example.com",
+            },
+            "summary": {
+                "total-successful-session-count": 10,
+                "total-failure-session-count": 1,
+            },
+            "failure-details": [
+                {
+                    "result-type": "certificate-expired",
+                    "failed-session-count": 1,
+                }
+            ],
+        }
+        result = parsedmarc._parse_smtp_tls_report_policy(policy)
+        self.assertEqual(len(result["failure_details"]), 1)
+        self.assertEqual(
+            result["failure_details"][0]["result_type"], "certificate-expired"
+        )
+
+    def testParseSmtpTlsReportPolicyMissingField(self):
+        """Missing required policy field raises InvalidSMTPTLSReport"""
+        policy = {"policy": {"policy-type": "sts"}, "summary": {}}
+        with self.assertRaises(parsedmarc.InvalidSMTPTLSReport):
+            parsedmarc._parse_smtp_tls_report_policy(policy)
+
+    def testParseSmtpTlsReportJsonValid(self):
+        """Valid SMTP TLS JSON report parses correctly"""
+        report = json.dumps(
+            {
+                "organization-name": "Example Corp",
+                "date-range": {
+                    "start-datetime": "2024-01-01T00:00:00Z",
+                    "end-datetime": "2024-01-02T00:00:00Z",
+                },
+                "contact-info": "admin@example.com",
+                "report-id": "report-123",
+                "policies": [
+                    {
+                        "policy": {
+                            "policy-type": "sts",
+                            "policy-domain": "example.com",
+                        },
+                        "summary": {
+                            "total-successful-session-count": 50,
+                            "total-failure-session-count": 0,
+                        },
+                    }
+                ],
+            }
+        )
+        result = parsedmarc.parse_smtp_tls_report_json(report)
+        self.assertEqual(result["organization_name"], "Example Corp")
+        self.assertEqual(result["report_id"], "report-123")
+        self.assertEqual(len(result["policies"]), 1)
+
+    def testParseSmtpTlsReportJsonBytes(self):
+        """SMTP TLS report as bytes parses correctly"""
+        report = json.dumps(
+            {
+                "organization-name": "Org",
+                "date-range": {
+                    "start-datetime": "2024-01-01",
+                    "end-datetime": "2024-01-02",
+                },
+                "contact-info": "a@b.com",
+                "report-id": "r1",
+                "policies": [
+                    {
+                        "policy": {"policy-type": "tlsa", "policy-domain": "a.com"},
+                        "summary": {
+                            "total-successful-session-count": 1,
+                            "total-failure-session-count": 0,
+                        },
+                    }
+                ],
+            }
+        ).encode("utf-8")
+        result = parsedmarc.parse_smtp_tls_report_json(report)
+        self.assertEqual(result["organization_name"], "Org")
+
+    def testParseSmtpTlsReportJsonMissingField(self):
+        """Missing required field raises InvalidSMTPTLSReport"""
+        report = json.dumps({"organization-name": "Org"})
+        with self.assertRaises(parsedmarc.InvalidSMTPTLSReport):
+            parsedmarc.parse_smtp_tls_report_json(report)
+
+    def testParseSmtpTlsReportJsonPoliciesNotList(self):
+        """Non-list policies raises InvalidSMTPTLSReport"""
+        report = json.dumps(
+            {
+                "organization-name": "Org",
+                "date-range": {
+                    "start-datetime": "2024-01-01",
+                    "end-datetime": "2024-01-02",
+                },
+                "contact-info": "a@b.com",
+                "report-id": "r1",
+                "policies": "not-a-list",
+            }
+        )
+        with self.assertRaises(parsedmarc.InvalidSMTPTLSReport):
+            parsedmarc.parse_smtp_tls_report_json(report)
+
+    def testAggregateReportInvalidNpWarning(self):
+        """Invalid np value is preserved but logs warning"""
+        xml = """<?xml version="1.0"?>
+        <feedback>
+            <version>1.0</version>
+            <report_metadata>
+                <org_name>Test Org</org_name>
+                <email>test@example.com</email>
+                <report_id>test-np-invalid</report_id>
+                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
+            </report_metadata>
+            <policy_published>
+                <domain>example.com</domain>
+                <p>none</p>
+                <np>banana</np>
+                <testing>maybe</testing>
+                <discovery_method>magic</discovery_method>
+            </policy_published>
+            <record>
+                <row>
+                    <source_ip>192.0.2.1</source_ip>
+                    <count>1</count>
+                    <policy_evaluated>
+                        <disposition>none</disposition>
+                        <dkim>pass</dkim>
+                        <spf>pass</spf>
+                    </policy_evaluated>
+                </row>
+                <identifiers><header_from>example.com</header_from></identifiers>
+                <auth_results>
+                    <spf><domain>example.com</domain><result>pass</result></spf>
+                </auth_results>
+            </record>
+        </feedback>"""
+        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+        # Invalid values are still stored
+        self.assertEqual(report["policy_published"]["np"], "banana")
+        self.assertEqual(report["policy_published"]["testing"], "maybe")
+        self.assertEqual(report["policy_published"]["discovery_method"], "magic")
+
+    def testAggregateReportPassDisposition(self):
+        """'pass' as valid disposition is preserved"""
+        xml = """<?xml version="1.0"?>
+        <feedback>
+            <report_metadata>
+                <org_name>TestOrg</org_name>
+                <email>test@example.com</email>
+                <report_id>test-pass</report_id>
+                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
+            </report_metadata>
+            <policy_published>
+                <domain>example.com</domain>
+                <p>reject</p>
+            </policy_published>
+            <record>
+                <row>
+                    <source_ip>192.0.2.1</source_ip>
+                    <count>1</count>
+                    <policy_evaluated>
+                        <disposition>pass</disposition>
+                        <dkim>pass</dkim>
+                        <spf>pass</spf>
+                    </policy_evaluated>
+                </row>
+                <identifiers><header_from>example.com</header_from></identifiers>
+                <auth_results>
+                    <spf><domain>example.com</domain><result>pass</result></spf>
+                </auth_results>
+            </record>
+        </feedback>"""
+        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+        self.assertEqual(
+            report["records"][0]["policy_evaluated"]["disposition"], "pass"
+        )
+
+    def testAggregateReportMultipleRecords(self):
+        """Reports with multiple records are all parsed"""
+        xml = """<?xml version="1.0"?>
+        <feedback>
+            <report_metadata>
+                <org_name>TestOrg</org_name>
+                <email>test@example.com</email>
+                <report_id>test-multi</report_id>
+                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
+            </report_metadata>
+            <policy_published>
+                <domain>example.com</domain>
+                <p>none</p>
+            </policy_published>
+            <record>
+                <row>
+                    <source_ip>192.0.2.1</source_ip>
+                    <count>10</count>
+                    <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
+                </row>
+                <identifiers><header_from>example.com</header_from></identifiers>
+                <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
+            </record>
+            <record>
+                <row>
+                    <source_ip>192.0.2.2</source_ip>
+                    <count>5</count>
+                    <policy_evaluated><disposition>quarantine</disposition><dkim>fail</dkim><spf>fail</spf></policy_evaluated>
+                </row>
+                <identifiers><header_from>example.com</header_from></identifiers>
+                <auth_results><spf><domain>example.com</domain><result>fail</result></spf></auth_results>
+            </record>
+        </feedback>"""
+        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+        self.assertEqual(len(report["records"]), 2)
+        self.assertEqual(report["records"][0]["count"], 10)
+        self.assertEqual(report["records"][1]["count"], 5)
+
+    def testAggregateReportInvalidXmlRecovery(self):
+        """Badly formed XML is recovered via lxml"""
+        xml = '<?xml version="1.0"?><feedback><report_metadata><org_name>Test</org_name><email>t@e.com</email><report_id>r1</report_id><date_range><begin>1704067200</begin><end>1704153599</end></date_range></report_metadata><policy_published><domain>example.com</domain><p>none</p></policy_published><record><row><source_ip>192.0.2.1</source_ip><count>1</count><policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated></row><identifiers><header_from>example.com</header_from></identifiers><auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results></record></feedback>'
+        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+        self.assertEqual(report["report_metadata"]["report_id"], "r1")
+
+    def testAggregateReportCsvRowsContainRFC9990Fields(self):
+        """CSV rows include np, testing, discovery_method columns"""
+        result = parsedmarc.parse_report_file(
+            "samples/aggregate/rfc9990-sample.xml",
+            always_use_local_files=True,
+            offline=True,
+        )
+        report = cast(AggregateReport, result["report"])
+        rows = parsedmarc.parsed_aggregate_reports_to_csv_rows(report)
+        self.assertTrue(len(rows) > 0)
+        row = rows[0]
+        self.assertIn("np", row)
+        self.assertIn("testing", row)
+        self.assertIn("discovery_method", row)
+        self.assertIn("source_ip_address", row)
+        self.assertIn("dkim_domains", row)
+        self.assertIn("spf_domains", row)
+
+    def testAggregateReportSchemaVersion(self):
+        """RFC 9990 report with <version> returns correct xml_schema"""
+        xml = """<?xml version="1.0"?>
+        <feedback>
+            <version>1.0</version>
+            <report_metadata>
+                <org_name>TestOrg</org_name>
+                <email>test@example.com</email>
+                <report_id>test-version</report_id>
+                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
+            </report_metadata>
+            <policy_published>
+                <domain>example.com</domain>
+                <p>none</p>
+            </policy_published>
+            <record>
+                <row>
+                    <source_ip>192.0.2.1</source_ip>
+                    <count>1</count>
+                    <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
+                </row>
+                <identifiers><header_from>example.com</header_from></identifiers>
+                <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
+            </record>
+        </feedback>"""
+        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+        self.assertEqual(report["xml_schema"], "1.0")
+
+    def testAggregateReportDraftSchema(self):
+        """Report without <version> defaults to 'draft' schema"""
+        xml = """<?xml version="1.0"?>
+        <feedback>
+            <report_metadata>
+                <org_name>TestOrg</org_name>
+                <email>test@example.com</email>
+                <report_id>test-draft</report_id>
+                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
+            </report_metadata>
+            <policy_published>
+                <domain>example.com</domain>
+                <p>none</p>
+            </policy_published>
+            <record>
+                <row>
+                    <source_ip>192.0.2.1</source_ip>
+                    <count>1</count>
+                    <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
+                </row>
+                <identifiers><header_from>example.com</header_from></identifiers>
+                <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
+            </record>
+        </feedback>"""
+        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+        self.assertEqual(report["xml_schema"], "draft")
+
+    def testAggregateReportGeneratorField(self):
+        """Generator field is correctly extracted"""
+        xml = """<?xml version="1.0"?>
+        <feedback>
+            <report_metadata>
+                <org_name>TestOrg</org_name>
+                <email>test@example.com</email>
+                <report_id>test-gen</report_id>
+                <generator>My Reporter v1.0</generator>
+                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
+            </report_metadata>
+            <policy_published>
+                <domain>example.com</domain>
+                <p>none</p>
+            </policy_published>
+            <record>
+                <row>
+                    <source_ip>192.0.2.1</source_ip>
+                    <count>1</count>
+                    <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
+                </row>
+                <identifiers><header_from>example.com</header_from></identifiers>
+                <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
+            </record>
+        </feedback>"""
+        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+        self.assertEqual(report["report_metadata"]["generator"], "My Reporter v1.0")
+
+    def testAggregateReportReportErrors(self):
+        """Report errors in metadata are captured"""
+        xml = """<?xml version="1.0"?>
+        <feedback>
+            <report_metadata>
+                <org_name>TestOrg</org_name>
+                <email>test@example.com</email>
+                <report_id>test-err</report_id>
+                <error>Some error</error>
+                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
+            </report_metadata>
+            <policy_published>
+                <domain>example.com</domain>
+                <p>none</p>
+            </policy_published>
+            <record>
+                <row>
+                    <source_ip>192.0.2.1</source_ip>
+                    <count>1</count>
+                    <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
+                </row>
+                <identifiers><header_from>example.com</header_from></identifiers>
+                <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
+            </record>
+        </feedback>"""
+        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+        self.assertIn("Some error", report["report_metadata"]["errors"])
+
+    def testAggregateReportPolicyDefaults(self):
+        """Policy defaults: adkim/aspf='r', sp=p, pct/fo=None"""
+        xml = """<?xml version="1.0"?>
+        <feedback>
+            <report_metadata>
+                <org_name>TestOrg</org_name>
+                <email>test@example.com</email>
+                <report_id>test-defaults</report_id>
+                <date_range><begin>1704067200</begin><end>1704153599</end></date_range>
+            </report_metadata>
+            <policy_published>
+                <domain>example.com</domain>
+                <p>reject</p>
+            </policy_published>
+            <record>
+                <row>
+                    <source_ip>192.0.2.1</source_ip>
+                    <count>1</count>
+                    <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
+                </row>
+                <identifiers><header_from>example.com</header_from></identifiers>
+                <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
+            </record>
+        </feedback>"""
+        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+        pp = report["policy_published"]
+        self.assertEqual(pp["adkim"], "r")
+        self.assertEqual(pp["aspf"], "r")
+        self.assertEqual(pp["sp"], "reject")  # defaults to p
+        self.assertIsNone(pp["pct"])
+        self.assertIsNone(pp["fo"])
+        self.assertIsNone(pp["np"])
+        self.assertIsNone(pp["testing"])
+        self.assertIsNone(pp["discovery_method"])
+
+    def testMagicXmlTagDetection(self):
+        """XML without declaration (starting with '<') is extracted"""
+        xml_no_decl = b"<feedback><report_metadata><org_name>T</org_name><email>a@b.com</email><report_id>r1</report_id><date_range><begin>1704067200</begin><end>1704153599</end></date_range></report_metadata><policy_published><domain>example.com</domain><p>none</p></policy_published><record><row><source_ip>192.0.2.1</source_ip><count>1</count><policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated></row><identifiers><header_from>example.com</header_from></identifiers><auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results></record></feedback>"
+        self.assertTrue(xml_no_decl.startswith(parsedmarc.MAGIC_XML_TAG))
+        # Ensure it extracts as XML
+        result = parsedmarc.extract_report(xml_no_decl)
+        self.assertIn("<feedback>", result)
+
+    def testSmtpTlsCsvRows(self):
+        """parsed_smtp_tls_reports_to_csv_rows produces correct rows"""
+        report_json = json.dumps(
+            {
+                "organization-name": "Org",
+                "date-range": {
+                    "start-datetime": "2024-01-01T00:00:00Z",
+                    "end-datetime": "2024-01-02T00:00:00Z",
+                },
+                "contact-info": "a@b.com",
+                "report-id": "r1",
+                "policies": [
+                    {
+                        "policy": {
+                            "policy-type": "sts",
+                            "policy-domain": "example.com",
+                            "policy-string": ["v: STSv1"],
+                            "mx-host-pattern": ["*.example.com"],
+                        },
+                        "summary": {
+                            "total-successful-session-count": 10,
+                            "total-failure-session-count": 1,
+                        },
+                        "failure-details": [
+                            {"result-type": "cert-expired", "failed-session-count": 1}
+                        ],
+                    }
+                ],
+            }
+        )
+        parsed = parsedmarc.parse_smtp_tls_report_json(report_json)
+        rows = parsedmarc.parsed_smtp_tls_reports_to_csv_rows(parsed)
+        self.assertTrue(len(rows) >= 2)
+        self.assertEqual(rows[0]["organization_name"], "Org")
+        self.assertEqual(rows[0]["policy_domain"], "example.com")
+
+    def testParsedAggregateReportsToCsvRowsList(self):
+        """parsed_aggregate_reports_to_csv_rows handles list of reports"""
+        result = parsedmarc.parse_report_file(
+            "samples/aggregate/rfc9990-sample.xml",
+            always_use_local_files=True,
+            offline=True,
+        )
+        report = cast(AggregateReport, result["report"])
+        # Pass as a list
+        rows = parsedmarc.parsed_aggregate_reports_to_csv_rows([report])
+        self.assertTrue(len(rows) > 0)
+        # Verify non-str/int/bool values are cleaned
+        for row in rows:
+            for v in row.values():
+                self.assertIn(type(v), [str, int, bool])
+
+    def testExceptionHierarchy(self):
+        """Exception class hierarchy is correct"""
+        self.assertTrue(issubclass(parsedmarc.ParserError, RuntimeError))
+        self.assertTrue(
+            issubclass(parsedmarc.InvalidDMARCReport, parsedmarc.ParserError)
+        )
+        self.assertTrue(
+            issubclass(parsedmarc.InvalidAggregateReport, parsedmarc.InvalidDMARCReport)
+        )
+        self.assertTrue(
+            issubclass(parsedmarc.InvalidFailureReport, parsedmarc.InvalidDMARCReport)
+        )
+        self.assertTrue(
+            issubclass(parsedmarc.InvalidSMTPTLSReport, parsedmarc.ParserError)
+        )
+        self.assertIs(parsedmarc.InvalidForensicReport, parsedmarc.InvalidFailureReport)
+
+    def testAggregateReportNormalization(self):
+        """Reports spanning >24h get normalized per day"""
+        xml = """<?xml version="1.0"?>
+        <feedback>
+            <report_metadata>
+                <org_name>TestOrg</org_name>
+                <email>test@example.com</email>
+                <report_id>test-norm</report_id>
+                <date_range><begin>1704067200</begin><end>1704326400</end></date_range>
+            </report_metadata>
+            <policy_published>
+                <domain>example.com</domain>
+                <p>none</p>
+            </policy_published>
+            <record>
+                <row>
+                    <source_ip>192.0.2.1</source_ip>
+                    <count>90</count>
+                    <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
+                </row>
+                <identifiers><header_from>example.com</header_from></identifiers>
+                <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
+            </record>
+        </feedback>"""
+        # Span is 259200 seconds (3 days), exceeds default 24h threshold
+        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+        self.assertTrue(report["report_metadata"]["timespan_requires_normalization"])
+        # Records should be split across days
+        self.assertTrue(len(report["records"]) > 1)
+        total = sum(r["count"] for r in report["records"])
+        self.assertEqual(total, 90)
+        for r in report["records"]:
+            self.assertTrue(r["normalized_timespan"])  # type: ignore[typeddict-item]
+
+    def testExtractReportFromFilePathNotFound(self):
+        """extract_report_from_file_path raises ParserError for missing file"""
+        with self.assertRaises(parsedmarc.ParserError):
+            parsedmarc.extract_report_from_file_path("nonexistent_file.xml")
+
+    def testExtractReportInvalidArchive(self):
+        """extract_report raises ParserError for unrecognized binary content"""
+        with self.assertRaises(parsedmarc.ParserError):
+            parsedmarc.extract_report(b"\x00\x01\x02\x03\x04\x05\x06\x07")
+
+    def testParseAggregateReportFile(self):
+        """parse_aggregate_report_file parses bytes input directly"""
+        print()
+        sample_path = "samples/aggregate/rfc9990-sample.xml"
+        print("Testing {0}: ".format(sample_path), end="")
+        with open(sample_path, "rb") as f:
+            data = f.read()
+        report = parsedmarc.parse_aggregate_report_file(
+            data,
+            offline=True,
+            always_use_local_files=True,
+        )
+        self.assertEqual(report["report_metadata"]["org_name"], "Sample Reporter")
+        self.assertEqual(report["policy_published"]["domain"], "example.com")
+        print("Passed!")
+
+    def testParseInvalidAggregateSample(self):
+        """Test invalid aggregate samples are handled"""
+        print()
+        sample_paths = glob("samples/aggregate_invalid/*")
+        for sample_path in sample_paths:
+            if os.path.isdir(sample_path):
+                continue
+            print("Testing {0}: ".format(sample_path), end="")
+            with self.subTest(sample=sample_path):
+                parsed_report = cast(
+                    AggregateReport,
+                    parsedmarc.parse_report_file(
+                        sample_path, always_use_local_files=True, offline=OFFLINE_MODE
+                    )["report"],
+                )
+                parsedmarc.parsed_aggregate_reports_to_csv(parsed_report)
+            print("Passed!")
+
+    def testParseReportFileWithBytes(self):
+        """parse_report_file handles bytes input"""
+        with open("samples/aggregate/rfc9990-sample.xml", "rb") as f:
+            data = f.read()
+        result = parsedmarc.parse_report_file(
+            data, always_use_local_files=True, offline=True
+        )
+        self.assertEqual(result["report_type"], "aggregate")
+
+    def testFailureReportCsvRoundtrip(self):
+        """Failure report CSV generation works on sample reports"""
+        print()
+        sample_paths = glob("samples/failure/*.eml")
+        for sample_path in sample_paths:
+            print("Testing CSV for {0}: ".format(sample_path), end="")
+            with self.subTest(sample=sample_path):
+                parsed_report = cast(
+                    FailureReport,
+                    parsedmarc.parse_report_file(sample_path, offline=OFFLINE_MODE)[
+                        "report"
+                    ],
+                )
+                csv_output = parsedmarc.parsed_failure_reports_to_csv(parsed_report)
+                self.assertIsNotNone(csv_output)
+                self.assertIn(",", csv_output)
+                rows = parsedmarc.parsed_failure_reports_to_csv_rows(parsed_report)
+                self.assertTrue(len(rows) > 0)
+            print("Passed!")
+
+
+class TestExtractReport(unittest.TestCase):
+    """Tests for parsedmarc.extract_report()"""
+
+    def testExtractReportFromBytes(self):
+        """extract_report handles raw XML bytes"""
+        xml = b'<?xml version="1.0"?><feedback><report_metadata></report_metadata></feedback>'
+        result = parsedmarc.extract_report(xml)
+        self.assertIn("<feedback>", result)
+
+    def testExtractReportFromBase64Xml(self):
+        """extract_report handles base64-encoded XML string"""
+        import base64
+
+        xml = b'<?xml version="1.0"?><feedback></feedback>'
+        b64 = base64.b64encode(xml).decode()
+        result = parsedmarc.extract_report(b64)
+        self.assertIn("<feedback>", result)
+
+    def testExtractReportFromGzip(self):
+        """extract_report handles gzip compressed content"""
+        import gzip
+
+        xml = b'<?xml version="1.0"?><feedback></feedback>'
+        compressed = gzip.compress(xml)
+        result = parsedmarc.extract_report(compressed)
+        self.assertIn("<feedback>", result)
+
+    def testExtractReportFromZip(self):
+        """extract_report handles zip compressed content"""
+        import zipfile
+
+        xml = b'<?xml version="1.0"?><feedback></feedback>'
+        buf = BytesIO()
+        with zipfile.ZipFile(buf, "w") as zf:
+            zf.writestr("report.xml", xml)
+        result = parsedmarc.extract_report(buf.getvalue())
+        self.assertIn("<feedback>", result)
+
+    def testExtractReportFromBinaryIO(self):
+        """extract_report handles file-like BinaryIO objects"""
+        xml = b'<?xml version="1.0"?><feedback></feedback>'
+        bio = BytesIO(xml)
+        result = parsedmarc.extract_report(bio)
+        self.assertIn("<feedback>", result)
+
+    def testExtractReportFromNonSeekableStream(self):
+        """extract_report handles non-seekable streams"""
+        xml = b'<?xml version="1.0"?><feedback></feedback>'
+
+        class NonSeekable:
+            def __init__(self, data):
+                self._data = data
+                self._pos = 0
+
+            def read(self, n=-1):
+                if n == -1:
+                    result = self._data[self._pos :]
+                    self._pos = len(self._data)
+                else:
+                    result = self._data[self._pos : self._pos + n]
+                    self._pos += n
+                return result
+
+            def seekable(self):
+                return False
+
+            def close(self):
+                pass
+
+        result = parsedmarc.extract_report(cast(BinaryIO, NonSeekable(xml)))
+        self.assertIn("<feedback>", result)
+
+    def testExtractReportInvalidContent(self):
+        """extract_report raises ParserError for invalid content"""
+        with self.assertRaises(parsedmarc.ParserError):
+            parsedmarc.extract_report(b"this is not a valid archive")
+
+    def testExtractReportTextModeRaises(self):
+        """extract_report raises ParserError for text-mode streams"""
+
+        class TextStream:
+            def read(self, n=-1):
+                return "text data"
+
+            def seekable(self):
+                return True
+
+            def seek(self, pos):
+                pass
+
+            def close(self):
+                pass
+
+        with self.assertRaises(parsedmarc.ParserError):
+            parsedmarc.extract_report(cast(BinaryIO, TextStream()))
+
+
+class TestMalformedXmlRecovery(unittest.TestCase):
+    """Tests for XML recovery in parse_aggregate_report_xml"""
+
+    def testRecoversMalformedXml(self):
+        """Malformed XML triggers recovery path and still parses"""
+        # XML with a broken tag that xmltodict will reject but lxml can recover
+        malformed_xml = """<?xml version="1.0"?>
+<feedback>
+  <report_metadata>
+    <org_name>example.com</org_name>
+    <email>dmarc@example.com</email>
+    <report_id>12345</report_id>
+    <date_range><begin>1680000000</begin><end>1680086400</end></date_range>
+  </report_metadata>
+  <policy_published>
+    <domain>example.com</domain><p>none</p>
+  </policy_published>
+  <record>
+    <row><source_ip>203.0.113.1</source_ip><count>1</count>
+      <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
+    </row>
+    <identifiers><header_from>example.com</header_from></identifiers>
+    <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
+  </record>
+  <broken_tag
+</feedback>"""
+        # lxml recovery may succeed or fail depending on how broken the XML is
+        # Either way, no unhandled exception should escape
+        try:
+            report = parsedmarc.parse_aggregate_report_xml(malformed_xml, offline=True)
+            self.assertIn("report_metadata", report)
+        except parsedmarc.InvalidAggregateReport:
+            pass  # Also acceptable
+
+    def testBytesXmlInput(self):
+        """XML bytes input is decoded"""
+        xml = b"""<?xml version="1.0"?>
+<feedback>
+  <report_metadata>
+    <org_name>example.com</org_name>
+    <email>dmarc@example.com</email>
+    <report_id>test-bytes-input</report_id>
+    <date_range><begin>1680000000</begin><end>1680086400</end></date_range>
+  </report_metadata>
+  <policy_published>
+    <domain>example.com</domain><p>none</p>
+  </policy_published>
+  <record>
+    <row><source_ip>203.0.113.1</source_ip><count>1</count>
+      <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
+    </row>
+    <identifiers><header_from>example.com</header_from></identifiers>
+    <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
+  </record>
+</feedback>"""
+        report = parsedmarc.parse_aggregate_report_xml(xml.decode(), offline=True)
+        self.assertEqual(report["report_metadata"]["report_id"], "test-bytes-input")
+
+    def testExpatErrorRaises(self):
+        """Completely invalid XML raises InvalidAggregateReport"""
+        with self.assertRaises(parsedmarc.InvalidAggregateReport):
+            parsedmarc.parse_aggregate_report_xml("not xml at all {}", offline=True)
+
+    def testMissingOrgName(self):
+        """Missing org_name raises InvalidAggregateReport"""
+        xml = """<?xml version="1.0"?>
+<feedback>
+  <report_metadata>
+    <email>dmarc@example.com</email>
+    <report_id>missing-org</report_id>
+    <date_range><begin>1680000000</begin><end>1680086400</end></date_range>
+  </report_metadata>
+  <policy_published><domain>example.com</domain><p>none</p></policy_published>
+  <record>
+    <row><source_ip>1.2.3.4</source_ip><count>1</count>
+      <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
+    </row>
+    <identifiers><header_from>example.com</header_from></identifiers>
+    <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
+  </record>
+</feedback>"""
+        with self.assertRaises(parsedmarc.InvalidAggregateReport):
+            parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+
+
+class TestPolicyPublishedEdgeCases(unittest.TestCase):
+    """Tests for edge cases in policy_published parsing"""
+
+    VALID_XML_TEMPLATE = """<?xml version="1.0"?>
+<feedback>
+  <report_metadata>
+    <org_name>example.com</org_name>
+    <email>dmarc@example.com</email>
+    <report_id>test-{tag}</report_id>
+    <date_range><begin>1680000000</begin><end>1680086400</end></date_range>
+    {extra_metadata}
+  </report_metadata>
+  <policy_published>
+    <domain>example.com</domain><p>reject</p>
+    {policy_extra}
+  </policy_published>
+  <record>
+    <row><source_ip>203.0.113.1</source_ip><count>1</count>
+      <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
+    </row>
+    <identifiers><header_from>example.com</header_from></identifiers>
+    <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
+  </record>
+</feedback>"""
+
+    def _parse(self, tag="default", policy_extra="", extra_metadata=""):
+        xml = self.VALID_XML_TEMPLATE.format(
+            tag=tag, policy_extra=policy_extra, extra_metadata=extra_metadata
+        )
+        return parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+
+    def testPolicyPublishedListHandled(self):
+        """policy_published as a list uses first element"""
+        # The code checks `if type(policy_published) is list`
+        # This is tested implicitly when xmltodict returns a list;
+        # we test via the np field presence
+        report = self._parse(tag="np", policy_extra="<np>quarantine</np>")
+        self.assertEqual(report["policy_published"]["np"], "quarantine")
+
+    def testNpFieldValues(self):
+        """np field is parsed correctly"""
+        for val in ["none", "quarantine", "reject"]:
+            report = self._parse(tag=f"np-{val}", policy_extra=f"<np>{val}</np>")
+            self.assertEqual(report["policy_published"]["np"], val)
+
+    def testTestingField(self):
+        """testing field is parsed correctly"""
+        for val in ["y", "n"]:
+            report = self._parse(
+                tag=f"testing-{val}", policy_extra=f"<testing>{val}</testing>"
+            )
+            self.assertEqual(report["policy_published"]["testing"], val)
+
+    def testDiscoveryMethodField(self):
+        """discovery_method field is parsed correctly"""
+        for val in ["psl", "treewalk"]:
+            report = self._parse(
+                tag=f"disc-{val}",
+                policy_extra=f"<discovery_method>{val}</discovery_method>",
+            )
+            self.assertEqual(report["policy_published"]["discovery_method"], val)
+
+    def testGeneratorField(self):
+        """generator field in report_metadata is parsed"""
+        report = self._parse(
+            tag="gen", extra_metadata="<generator>TestGen/1.0</generator>"
+        )
+        self.assertEqual(report["report_metadata"]["generator"], "TestGen/1.0")
+
+    def testPctFieldNone(self):
+        """pct defaults to None when absent (removed in RFC 9989)"""
+        report = self._parse(tag="no-pct")
+        self.assertIsNone(report["policy_published"]["pct"])
+
+    def testFoFieldNone(self):
+        """fo defaults to None when absent (RFC 9990 keeps it optional)"""
+        report = self._parse(tag="no-fo")
+        self.assertIsNone(report["policy_published"]["fo"])
+
+    def testReportMetadataErrors(self):
+        """Report metadata errors are captured"""
+        report = self._parse(
+            tag="errors",
+            extra_metadata="<error>DNS timeout</error>",
+        )
+        self.assertIn("DNS timeout", report["report_metadata"]["errors"])
+
+    def testReportMetadataErrorsList(self):
+        """Report metadata errors as list are captured"""
+        report = self._parse(
+            tag="errors-list",
+            extra_metadata="<error>error1</error><error>error2</error>",
+        )
+        self.assertIn("error1", report["report_metadata"]["errors"])
+        self.assertIn("error2", report["report_metadata"]["errors"])
+
+    def testRecordParseFailureSkipped(self):
+        """Bad records are skipped with a warning, not crashing"""
+        xml = """<?xml version="1.0"?>
+<feedback>
+  <report_metadata>
+    <org_name>example.com</org_name>
+    <email>dmarc@example.com</email>
+    <report_id>bad-records</report_id>
+    <date_range><begin>1680000000</begin><end>1680086400</end></date_range>
+  </report_metadata>
+  <policy_published><domain>example.com</domain><p>none</p></policy_published>
+  <record>
+    <row><source_ip>203.0.113.1</source_ip><count>1</count>
+      <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
+    </row>
+    <identifiers><header_from>example.com</header_from></identifiers>
+    <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
+  </record>
+  <record>
+    <row><source_ip>bad-ip</source_ip><count>not-a-number</count>
+      <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
+    </row>
+    <identifiers><header_from>example.com</header_from></identifiers>
+    <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
+  </record>
+</feedback>"""
+        report = parsedmarc.parse_aggregate_report_xml(xml, offline=True)
+        # At least the valid record should be parsed
+        self.assertTrue(len(report["records"]) >= 1)
+
+
+class TestParseReportFile(unittest.TestCase):
+    """Tests for parse_report_file with various input types"""
+
+    def testParseReportFileFromBytes(self):
+        """parse_report_file works with bytes input"""
+        xml_path = "samples/aggregate/!example.com!1538204542!1538463818.xml"
+        with open(xml_path, "rb") as f:
+            content = f.read()
+        result = parsedmarc.parse_report_file(content, offline=True)
+        self.assertEqual(result["report_type"], "aggregate")
+
+    def testParseReportFileFromBinaryIO(self):
+        """parse_report_file works with BinaryIO input"""
+        xml_path = "samples/aggregate/!example.com!1538204542!1538463818.xml"
+        with open(xml_path, "rb") as f:
+            result = parsedmarc.parse_report_file(f, offline=True)
+        self.assertEqual(result["report_type"], "aggregate")
+
+    def testParseReportFileFromPathlib(self):
+        """parse_report_file works with pathlib.Path input"""
+        xml_path = Path("samples/aggregate/!example.com!1538204542!1538463818.xml")
+        result = parsedmarc.parse_report_file(xml_path, offline=True)
+        self.assertEqual(result["report_type"], "aggregate")
+
+    def testParseReportFileSmtpTls(self):
+        """parse_report_file detects SMTP TLS reports"""
+        result = parsedmarc.parse_report_file(
+            "samples/smtp_tls/smtp_tls.json", offline=True
+        )
+        self.assertEqual(result["report_type"], "smtp_tls")
+
+    def testParseReportFileEmail(self):
+        """parse_report_file detects failure reports in email format"""
+        eml_path = "samples/failure/dmarc_ruf_report_linkedin.eml"
+        result = parsedmarc.parse_report_file(eml_path, offline=True)
+        self.assertEqual(result["report_type"], "failure")
+
+    def testParseReportFileInvalid(self):
+        """parse_report_file raises ParserError for invalid content"""
+        with self.assertRaises(parsedmarc.ParserError):
+            parsedmarc.parse_report_file(b"this is not a report", offline=True)
+
+
+class TestParseReportEmail(unittest.TestCase):
+    """Tests for parse_report_email edge cases"""
+
+    def testSmtpTlsEmailReport(self):
+        """parse_report_email handles SMTP TLS reports in email format"""
+        eml_path = "samples/smtp_tls/google.com_smtp_tls_report.eml"
+        with open(eml_path, "rb") as f:
+            content = f.read()
+        result = parsedmarc.parse_report_email(content, offline=True)
+        self.assertEqual(result["report_type"], "smtp_tls")
+
+    def testInvalidEmailRaisesError(self):
+        """parse_report_email raises error for non-DMARC email"""
+        email_str = """From: test@example.com
+Subject: Hello World
+Content-Type: text/plain
+
+This is not a DMARC report."""
+        with self.assertRaises(parsedmarc.InvalidDMARCReport):
+            parsedmarc.parse_report_email(email_str, offline=True)
+
+
+class TestFailureReportParsing(unittest.TestCase):
+    """Tests for failure report field defaults and edge cases"""
+
+    def _make_feedback_report(self, **overrides):
+        """Create a minimal feedback report string"""
+        fields = {
+            "Feedback-Type": "auth-failure",
+            "User-Agent": "test/1.0",
+            "Version": "1",
+            "Original-Mail-From": "sender@example.com",
+            "Arrival-Date": "Thu, 1 Jan 2024 00:00:00 +0000",
+            "Source-IP": "203.0.113.1",
+            "Reported-Domain": "example.com",
+            "Auth-Failure": "dmarc",
+        }
+        fields.update(overrides)
+        return "\n".join(f"{k}: {v}" for k, v in fields.items())
+
+    def _make_sample(self):
+        return """From: sender@example.com
+To: recipient@example.com
+Subject: Test
+Date: Thu, 1 Jan 2024 00:00:00 +0000
+
+Test body"""
+
+    def _default_msg_date(self):
+        return datetime(2024, 1, 1, 0, 0, 0, tzinfo=timezone.utc)
+
+    def testMissingVersion(self):
+        """Missing version defaults to None"""
+        report_str = self._make_feedback_report()
+        lines = [ln for ln in report_str.split("\n") if not ln.startswith("Version:")]
+        report_str = "\n".join(lines)
+        report = parsedmarc.parse_failure_report(
+            report_str, self._make_sample(), self._default_msg_date(), offline=True
+        )
+        self.assertIsNone(report["version"])
+
+    def testMissingUserAgent(self):
+        """Missing user_agent defaults to None"""
+        report_str = self._make_feedback_report()
+        lines = [
+            ln for ln in report_str.split("\n") if not ln.startswith("User-Agent:")
+        ]
+        report_str = "\n".join(lines)
+        report = parsedmarc.parse_failure_report(
+            report_str, self._make_sample(), self._default_msg_date(), offline=True
+        )
+        self.assertIsNone(report["user_agent"])
+
+    def testMissingDeliveryResult(self):
+        """Missing delivery_result maps to 'other' when field absent"""
+        report_str = self._make_feedback_report()
+        report = parsedmarc.parse_failure_report(
+            report_str, self._make_sample(), self._default_msg_date(), offline=True
+        )
+        # When delivery_result is not in the parsed report, it's set to None,
+        # but then the validation check maps None (not in delivery_results list) to "other"
+        self.assertEqual(report["delivery_result"], "other")
+
+    def testDeliveryResultMapped(self):
+        """Known delivery_result values are mapped correctly"""
+        for val in ["delivered", "spam", "policy", "reject"]:
+            report_str = self._make_feedback_report(**{"Delivery-Result": val})
+            report = parsedmarc.parse_failure_report(
+                report_str, self._make_sample(), self._default_msg_date(), offline=True
+            )
+            self.assertEqual(report["delivery_result"], val)
+
+    def testDeliveryResultUnknownMapsToOther(self):
+        """Unknown delivery_result maps to 'other'"""
+        report_str = self._make_feedback_report(**{"Delivery-Result": "unknown-value"})
+        report = parsedmarc.parse_failure_report(
+            report_str, self._make_sample(), self._default_msg_date(), offline=True
+        )
+        self.assertEqual(report["delivery_result"], "other")
+
+    def testIdentityAlignmentNone(self):
+        """identity_alignment='none' results in empty auth mechanisms"""
+        report_str = self._make_feedback_report(**{"Identity-Alignment": "none"})
+        report = parsedmarc.parse_failure_report(
+            report_str, self._make_sample(), self._default_msg_date(), offline=True
+        )
+        self.assertEqual(report["authentication_mechanisms"], [])
+
+    def testIdentityAlignmentMultiple(self):
+        """identity_alignment with multiple values is split"""
+        report_str = self._make_feedback_report(**{"Identity-Alignment": "dkim,spf"})
+        report = parsedmarc.parse_failure_report(
+            report_str, self._make_sample(), self._default_msg_date(), offline=True
+        )
+        self.assertEqual(report["authentication_mechanisms"], ["dkim", "spf"])
+
+    def testIdentityAlignmentCFWSWhitespaceStripped(self):
+        """RFC 9991 ABNF allows CFWS around the commas in
+        Identity-Alignment. The previous parser left leading whitespace
+        on the second token ('dkim, spf' -> ['dkim', ' spf']); CFWS-aware
+        splitting yields ['dkim', 'spf']."""
+        report_str = self._make_feedback_report(**{"Identity-Alignment": "dkim, spf"})
+        report = parsedmarc.parse_failure_report(
+            report_str, self._make_sample(), self._default_msg_date(), offline=True
+        )
+        self.assertEqual(report["authentication_mechanisms"], ["dkim", "spf"])
+
+    def testAuthFailureCFWSWhitespaceStripped(self):
+        """Auth-Failure (also comma-separated per RFC 9991) is whitespace-
+        stripped per token."""
+        report_str = self._make_feedback_report(**{"Auth-Failure": "dmarc, spf"})
+        report = parsedmarc.parse_failure_report(
+            report_str, self._make_sample(), self._default_msg_date(), offline=True
+        )
+        self.assertEqual(report["auth_failure"], ["dmarc", "spf"])
+
+    def testMissingIdentityAlignmentWarns(self):
+        """Identity-Alignment is REQUIRED per RFC 9991; the parser
+        defaults silently for permissiveness but logs a warning so the
+        broken reporter is visible."""
+        report_str = self._make_feedback_report()
+        lines = [
+            ln
+            for ln in report_str.split("\n")
+            if not ln.startswith("Identity-Alignment:")
+        ]
+        report_str = "\n".join(lines)
+        with self.assertLogs("parsedmarc.log", level="WARNING") as cm:
+            report = parsedmarc.parse_failure_report(
+                report_str,
+                self._make_sample(),
+                self._default_msg_date(),
+                offline=True,
+            )
+        self.assertEqual(report["authentication_mechanisms"], [])
+        self.assertTrue(
+            any("Identity-Alignment" in m and "RFC 9991" in m for m in cm.output),
+            f"Expected Identity-Alignment RFC 9991 warning; got: {cm.output}",
+        )
+
+    def testMissingAuthFailureWarns(self):
+        """Auth-Failure is REQUIRED per RFC 9991; the parser defaults
+        to 'dmarc' but logs a warning."""
+        report_str = self._make_feedback_report()
+        lines = [
+            ln for ln in report_str.split("\n") if not ln.startswith("Auth-Failure:")
+        ]
+        report_str = "\n".join(lines)
+        with self.assertLogs("parsedmarc.log", level="WARNING") as cm:
+            report = parsedmarc.parse_failure_report(
+                report_str,
+                self._make_sample(),
+                self._default_msg_date(),
+                offline=True,
+            )
+        self.assertEqual(report["auth_failure"], ["dmarc"])
+        self.assertTrue(
+            any("Auth-Failure" in m and "RFC 9991" in m for m in cm.output),
+            f"Expected Auth-Failure RFC 9991 warning; got: {cm.output}",
+        )
+
+    def testMissingReportedDomainFallback(self):
+        """Missing reported_domain falls back to sample from domain"""
+        report_str = self._make_feedback_report()
+        lines = [
+            ln for ln in report_str.split("\n") if not ln.startswith("Reported-Domain:")
+        ]
+        report_str = "\n".join(lines)
+        report = parsedmarc.parse_failure_report(
+            report_str, self._make_sample(), self._default_msg_date(), offline=True
+        )
+        self.assertEqual(report["reported_domain"], "example.com")
+
+    def testMissingArrivalDateWithMsgDate(self):
+        """Missing arrival_date uses msg_date fallback"""
+        report_str = self._make_feedback_report()
+        lines = [
+            ln for ln in report_str.split("\n") if not ln.startswith("Arrival-Date:")
+        ]
+        report_str = "\n".join(lines)
+        msg_date = datetime(2024, 6, 15, 12, 0, 0, tzinfo=timezone.utc)
+        report = parsedmarc.parse_failure_report(
+            report_str, self._make_sample(), msg_date, offline=True
+        )
+        self.assertIn("2024-06-15", report["arrival_date"])
+
+    def testMissingArrivalDateNoMsgDateRaises(self):
+        """Missing arrival_date with no msg_date raises"""
+        report_str = self._make_feedback_report()
+        lines = [
+            ln for ln in report_str.split("\n") if not ln.startswith("Arrival-Date:")
+        ]
+        report_str = "\n".join(lines)
+        with self.assertRaises(parsedmarc.InvalidFailureReport):
+            parsedmarc.parse_failure_report(
+                report_str,
+                self._make_sample(),
+                cast(datetime, None),  # intentionally None to test error path
+                offline=True,
+            )
+
+
+class TestSmtpTlsReportErrors(unittest.TestCase):
+    """Tests for SMTP TLS report error handling"""
+
+    def testMissingRequiredField(self):
+        """Missing required field raises InvalidSMTPTLSReport"""
+        json_str = json.dumps({"policies": []})
+        with self.assertRaises(parsedmarc.InvalidSMTPTLSReport):
+            parsedmarc.parse_smtp_tls_report_json(json_str)
+
+    def testInvalidJson(self):
+        """Invalid JSON raises InvalidSMTPTLSReport"""
+        with self.assertRaises(parsedmarc.InvalidSMTPTLSReport):
+            parsedmarc.parse_smtp_tls_report_json("not json {{{")
+
+
+class TestBucketIntervalEdgeCases(unittest.TestCase):
+    """Tests for _bucket_interval_by_day edge cases"""
+
+    def testDayCursorAdjustment(self):
+        """When begin is before midnight due to tz, day_cursor adjusts back"""
+        # Use a timezone where midnight calculation might cause day_cursor > begin
+        import pytz
+
+        tz = pytz.FixedOffset(-600)  # UTC-10
+        begin = datetime(2024, 1, 1, 23, 30, 0, tzinfo=timezone.utc).astimezone(tz)
+        end = datetime(2024, 1, 3, 0, 0, 0, tzinfo=timezone.utc).astimezone(tz)
+        buckets = parsedmarc._bucket_interval_by_day(begin, end, 100)
+        total = sum(b["count"] for b in buckets)
+        self.assertEqual(total, 100)
+
+
+class TestGetDmarcReportsFromMbox(unittest.TestCase):
+    """Tests for mbox parsing"""
+
+    def testEmptyMbox(self):
+        """Empty mbox returns empty results"""
+        with NamedTemporaryFile(suffix=".mbox", delete=False) as f:
+            f.write(b"")
+            path = f.name
+        try:
+            results = parsedmarc.get_dmarc_reports_from_mbox(path, offline=True)
+            self.assertEqual(results["aggregate_reports"], [])
+            self.assertEqual(results["failure_reports"], [])
+            self.assertEqual(results["smtp_tls_reports"], [])
+        finally:
+            os.remove(path)
+
+    def testMboxWithAggregateReport(self):
+        """Mbox with aggregate report email is parsed"""
+        from email.mime.multipart import MIMEMultipart
+        from email.mime.application import MIMEApplication
+        import gzip
+
+        xml = b"""<?xml version="1.0"?>
+<feedback>
+  <report_metadata>
+    <org_name>example.com</org_name>
+    <email>dmarc@example.com</email>
+    <report_id>mbox-test-123</report_id>
+    <date_range><begin>1680000000</begin><end>1680086400</end></date_range>
+  </report_metadata>
+  <policy_published><domain>example.com</domain><p>none</p></policy_published>
+  <record>
+    <row><source_ip>203.0.113.1</source_ip><count>1</count>
+      <policy_evaluated><disposition>none</disposition><dkim>pass</dkim><spf>pass</spf></policy_evaluated>
+    </row>
+    <identifiers><header_from>example.com</header_from></identifiers>
+    <auth_results><spf><domain>example.com</domain><result>pass</result></spf></auth_results>
+  </record>
+</feedback>"""
+        compressed = gzip.compress(xml)
+
+        msg = MIMEMultipart()
+        msg["From"] = "dmarc@example.com"
+        msg["To"] = "postmaster@example.com"
+        msg["Subject"] = "DMARC Aggregate Report"
+        msg["Date"] = "Thu, 1 Jan 2024 00:00:00 +0000"
+        att = MIMEApplication(compressed, "gzip")
+        att.add_header("Content-Disposition", "attachment", filename="report.xml.gz")
+        msg.attach(att)
+
+        with NamedTemporaryFile(suffix=".mbox", delete=False, mode="w") as f:
+            # mbox format requires "From " line
+            f.write("From dmarc@example.com Thu Jan  1 00:00:00 2024\n")
+            f.write(msg.as_string())
+            f.write("\n")
+            path = f.name
+        try:
+            results = parsedmarc.get_dmarc_reports_from_mbox(path, offline=True)
+            self.assertTrue(len(results["aggregate_reports"]) >= 1)
+        finally:
+            os.remove(path)
+
+
+if __name__ == "__main__":
+    unittest.main(verbosity=2)
diff --git a/tests/test_kafkaclient.py b/tests/test_kafkaclient.py
new file mode 100644
index 0000000..ba90530
--- /dev/null
+++ b/tests/test_kafkaclient.py
@@ -0,0 +1,58 @@
+"""Tests for parsedmarc.kafkaclient"""
+
+import unittest
+
+
+class Test(unittest.TestCase):
+    """Kitchen-sink tests redistributed from the original
+    tests.py monolith. Future PRs should split these further
+    into purpose-specific TestCase subclasses as natural
+    groupings emerge."""
+
+    def testKafkaStripMetadata(self):
+        """KafkaClient.strip_metadata extracts metadata to root"""
+        from parsedmarc.kafkaclient import KafkaClient
+
+        report = {
+            "report_metadata": {
+                "org_name": "TestOrg",
+                "org_email": "test@example.com",
+                "report_id": "r-123",
+                "begin_date": "2024-01-01",
+                "end_date": "2024-01-02",
+            },
+            "records": [],
+        }
+        result = KafkaClient.strip_metadata(report)
+        self.assertEqual(result["org_name"], "TestOrg")
+        self.assertEqual(result["org_email"], "test@example.com")
+        self.assertEqual(result["report_id"], "r-123")
+        self.assertNotIn("report_metadata", result)
+
+    def testKafkaGenerateDateRange(self):
+        """KafkaClient.generate_date_range generates date range list"""
+        from parsedmarc.kafkaclient import KafkaClient
+
+        report = {
+            "report_metadata": {
+                "begin_date": "2024-01-01 00:00:00",
+                "end_date": "2024-01-02 00:00:00",
+            }
+        }
+        result = KafkaClient.generate_date_range(report)
+        self.assertEqual(len(result), 2)
+        self.assertIn("2024-01-01", result[0])
+        self.assertIn("2024-01-02", result[1])
+
+    def testKafkaBackwardCompatAlias(self):
+        """KafkaClient forensic alias points to failure method"""
+        from parsedmarc.kafkaclient import KafkaClient
+
+        self.assertIs(
+            KafkaClient.save_forensic_reports_to_kafka,  # type: ignore[attr-defined]
+            KafkaClient.save_failure_reports_to_kafka,
+        )
+
+
+if __name__ == "__main__":
+    unittest.main(verbosity=2)
diff --git a/tests/test_loganalytics.py b/tests/test_loganalytics.py
new file mode 100644
index 0000000..a80c11b
--- /dev/null
+++ b/tests/test_loganalytics.py
@@ -0,0 +1,53 @@
+"""Tests for parsedmarc.loganalytics"""
+
+import unittest
+
+
+class Test(unittest.TestCase):
+    """Kitchen-sink tests redistributed from the original
+    tests.py monolith. Future PRs should split these further
+    into purpose-specific TestCase subclasses as natural
+    groupings emerge."""
+
+    def testLogAnalyticsConfig(self):
+        """LogAnalyticsConfig stores all fields"""
+        from parsedmarc.loganalytics import LogAnalyticsConfig
+
+        config = LogAnalyticsConfig(
+            client_id="cid",
+            client_secret="csec",
+            tenant_id="tid",
+            dce="https://dce.example.com",
+            dcr_immutable_id="dcr-123",
+            dcr_aggregate_stream="agg-stream",
+            dcr_failure_stream="fail-stream",
+            dcr_smtp_tls_stream="tls-stream",
+        )
+        self.assertEqual(config.client_id, "cid")
+        self.assertEqual(config.client_secret, "csec")
+        self.assertEqual(config.tenant_id, "tid")
+        self.assertEqual(config.dce, "https://dce.example.com")
+        self.assertEqual(config.dcr_immutable_id, "dcr-123")
+        self.assertEqual(config.dcr_aggregate_stream, "agg-stream")
+        self.assertEqual(config.dcr_failure_stream, "fail-stream")
+        self.assertEqual(config.dcr_smtp_tls_stream, "tls-stream")
+
+    def testLogAnalyticsClientValidationError(self):
+        """LogAnalyticsClient raises on missing required config"""
+        from parsedmarc.loganalytics import LogAnalyticsClient, LogAnalyticsException
+
+        with self.assertRaises(LogAnalyticsException):
+            LogAnalyticsClient(
+                client_id="",
+                client_secret="csec",
+                tenant_id="tid",
+                dce="https://dce.example.com",
+                dcr_immutable_id="dcr-123",
+                dcr_aggregate_stream="agg",
+                dcr_failure_stream="fail",
+                dcr_smtp_tls_stream="tls",
+            )
+
+
+if __name__ == "__main__":
+    unittest.main(verbosity=2)
diff --git a/tests/test_maps.py b/tests/test_maps.py
new file mode 100644
index 0000000..3696d14
--- /dev/null
+++ b/tests/test_maps.py
@@ -0,0 +1,142 @@
+"""Tests for the map-maintenance scripts under parsedmarc/resources/maps/.
+
+These scripts are maintainer-only batch tooling — they do not ship in the
+wheel — but they still need regression coverage because they enforce the
+privacy and integrity rules for the reverse-DNS map data files."""
+
+import unittest
+
+
+class TestMapScriptsIPDetection(unittest.TestCase):
+    """Full-IP detection and PSL folding in the map-maintenance scripts."""
+
+    def test_collect_domain_info_detects_full_ips(self):
+        import parsedmarc.resources.maps.collect_domain_info as cdi
+
+        # Dotted and dashed four-octet patterns with valid octets: detected.
+        self.assertTrue(cdi._has_full_ip("74-208-244-234.cprapid.com"))
+        self.assertTrue(cdi._has_full_ip("host.192.168.1.1.example.com"))
+        self.assertTrue(cdi._has_full_ip("a-10-20-30-40-brand.com"))
+        # Three octets is NOT a full IP — OVH's reverse-DNS pattern stays safe.
+        self.assertFalse(cdi._has_full_ip("ip-147-135-108.us"))
+        # Out-of-range octet fails the 0-255 sanity check.
+        self.assertFalse(cdi._has_full_ip("999-1-2-3-foo.com"))
+        # Pure domain, no IP.
+        self.assertFalse(cdi._has_full_ip("example.com"))
+
+    def test_find_unknown_detects_full_ips(self):
+        import parsedmarc.resources.maps.find_unknown_base_reverse_dns as fu
+
+        self.assertTrue(fu._has_full_ip("170-254-144-204-nobreinternet.com.br"))
+        self.assertFalse(fu._has_full_ip("ip-147-135-108.us"))
+        self.assertFalse(fu._has_full_ip("cprapid.com"))
+
+    def test_apply_psl_override_dot_prefix(self):
+        import parsedmarc.resources.maps.collect_domain_info as cdi
+
+        ov = [".cprapid.com", ".linode.com"]
+        self.assertEqual(cdi._apply_psl_override("foo.cprapid.com", ov), "cprapid.com")
+        self.assertEqual(cdi._apply_psl_override("a.b.linode.com", ov), "linode.com")
+
+    def test_apply_psl_override_dash_prefix(self):
+        import parsedmarc.resources.maps.collect_domain_info as cdi
+
+        ov = ["-nobre.com.br"]
+        self.assertEqual(
+            cdi._apply_psl_override("1-2-3-4-nobre.com.br", ov), "nobre.com.br"
+        )
+
+    def test_apply_psl_override_no_match(self):
+        import parsedmarc.resources.maps.collect_domain_info as cdi
+
+        ov = [".cprapid.com"]
+        self.assertEqual(cdi._apply_psl_override("example.com", ov), "example.com")
+
+
+class TestDetectPSLOverrides(unittest.TestCase):
+    """Cluster detection, brand-tail extraction, and full-pipeline behaviour
+    for `detect_psl_overrides.py`."""
+
+    def setUp(self):
+        import parsedmarc.resources.maps.detect_psl_overrides as dpo
+
+        self.dpo = dpo
+
+    def test_extract_brand_tail_dot_separator(self):
+        self.assertEqual(
+            self.dpo.extract_brand_tail("74-208-244-234.cprapid.com"),
+            ".cprapid.com",
+        )
+
+    def test_extract_brand_tail_dash_separator(self):
+        self.assertEqual(
+            self.dpo.extract_brand_tail("170-254-144-204-nobre.com.br"),
+            "-nobre.com.br",
+        )
+
+    def test_extract_brand_tail_no_separator(self):
+        self.assertEqual(
+            self.dpo.extract_brand_tail("host134-254-143-190tigobusiness.com.ni"),
+            "tigobusiness.com.ni",
+        )
+
+    def test_extract_brand_tail_no_ip_returns_none(self):
+        self.assertIsNone(self.dpo.extract_brand_tail("plain.example.com"))
+
+    def test_extract_brand_tail_rejects_short_tail(self):
+        """A tail shorter than MIN_TAIL_LEN is rejected to avoid folding to `.com`."""
+        # Four-octet IP followed by only `.br` (2 chars after the dot) — too short.
+        self.assertIsNone(self.dpo.extract_brand_tail("1-2-3-4.br"))
+
+    def test_detect_clusters_meets_threshold(self):
+        domains = [
+            "1-2-3-4.cprapid.com",
+            "5-6-7-8.cprapid.com",
+            "9-10-11-12.cprapid.com",
+            "1-2-3-4-other.com.br",  # not enough of these
+        ]
+        clusters = self.dpo.detect_clusters(domains, threshold=3, known_overrides=set())
+        self.assertIn(".cprapid.com", clusters)
+        self.assertEqual(len(clusters[".cprapid.com"]), 3)
+        self.assertNotIn("-other.com.br", clusters)
+
+    def test_detect_clusters_honours_threshold(self):
+        domains = [
+            "1-2-3-4.cprapid.com",
+            "5-6-7-8.cprapid.com",
+        ]
+        clusters = self.dpo.detect_clusters(domains, threshold=3, known_overrides=set())
+        self.assertEqual(clusters, {})
+
+    def test_detect_clusters_skips_known_overrides(self):
+        """Tails already in psl_overrides.txt must not be re-proposed."""
+        domains = [
+            "1-2-3-4.cprapid.com",
+            "5-6-7-8.cprapid.com",
+            "9-10-11-12.cprapid.com",
+        ]
+        clusters = self.dpo.detect_clusters(
+            domains, threshold=3, known_overrides={".cprapid.com"}
+        )
+        self.assertNotIn(".cprapid.com", clusters)
+
+    def test_apply_override_matches_first(self):
+        """apply_override iterates in list order and returns on the first match."""
+        ov = [".cprapid.com", "-nobre.com.br"]
+        self.assertEqual(
+            self.dpo.apply_override("1-2-3-4.cprapid.com", ov), "cprapid.com"
+        )
+        self.assertEqual(
+            self.dpo.apply_override("1-2-3-4-nobre.com.br", ov), "nobre.com.br"
+        )
+        self.assertEqual(self.dpo.apply_override("unrelated.com", ov), "unrelated.com")
+
+    def test_has_full_ip_shared_with_other_scripts(self):
+        """The detect script's IP check must agree with the other map scripts."""
+        self.assertTrue(self.dpo.has_full_ip("74-208-244-234.cprapid.com"))
+        self.assertFalse(self.dpo.has_full_ip("ip-147-135-108.us"))
+        self.assertFalse(self.dpo.has_full_ip("example.com"))
+
+
+if __name__ == "__main__":
+    unittest.main(verbosity=2)
diff --git a/tests/test_s3.py b/tests/test_s3.py
new file mode 100644
index 0000000..8010525
--- /dev/null
+++ b/tests/test_s3.py
@@ -0,0 +1,23 @@
+"""Tests for parsedmarc.s3"""
+
+import unittest
+
+
+class Test(unittest.TestCase):
+    """Kitchen-sink tests redistributed from the original
+    tests.py monolith. Future PRs should split these further
+    into purpose-specific TestCase subclasses as natural
+    groupings emerge."""
+
+    def testS3BackwardCompatAlias(self):
+        """S3Client forensic alias points to failure method"""
+        from parsedmarc.s3 import S3Client
+
+        self.assertIs(
+            S3Client.save_forensic_report_to_s3,  # type: ignore[attr-defined]
+            S3Client.save_failure_report_to_s3,
+        )
+
+
+if __name__ == "__main__":
+    unittest.main(verbosity=2)
diff --git a/tests/test_splunk.py b/tests/test_splunk.py
new file mode 100644
index 0000000..e098a07
--- /dev/null
+++ b/tests/test_splunk.py
@@ -0,0 +1,49 @@
+"""Tests for parsedmarc.splunk"""
+
+import unittest
+
+
+class Test(unittest.TestCase):
+    """Kitchen-sink tests redistributed from the original
+    tests.py monolith. Future PRs should split these further
+    into purpose-specific TestCase subclasses as natural
+    groupings emerge."""
+
+    def testSplunkHECClientInit(self):
+        """HECClient initializes with correct URL and headers"""
+        from parsedmarc.splunk import HECClient
+
+        client = HECClient(
+            url="https://splunk.example.com:8088",
+            access_token="my-token",
+            index="main",
+        )
+        self.assertIn("/services/collector/event/1.0", client.url)
+        self.assertEqual(client.access_token, "my-token")
+        self.assertEqual(client.index, "main")
+        self.assertEqual(client.source, "parsedmarc")
+        self.assertIn("Splunk my-token", client.session.headers["Authorization"])
+
+    def testSplunkHECClientStripTokenPrefix(self):
+        """HECClient strips 'Splunk ' prefix from token"""
+        from parsedmarc.splunk import HECClient
+
+        client = HECClient(
+            url="https://splunk.example.com",
+            access_token="Splunk my-token",
+            index="main",
+        )
+        self.assertEqual(client.access_token, "my-token")
+
+    def testSplunkBackwardCompatAlias(self):
+        """HECClient forensic alias points to failure method"""
+        from parsedmarc.splunk import HECClient
+
+        self.assertIs(
+            HECClient.save_forensic_reports_to_splunk,  # type: ignore[attr-defined]
+            HECClient.save_failure_reports_to_splunk,
+        )
+
+
+if __name__ == "__main__":
+    unittest.main(verbosity=2)
diff --git a/tests/test_syslog.py b/tests/test_syslog.py
new file mode 100644
index 0000000..696d782
--- /dev/null
+++ b/tests/test_syslog.py
@@ -0,0 +1,39 @@
+"""Tests for parsedmarc.syslog"""
+
+import unittest
+
+
+class Test(unittest.TestCase):
+    """Kitchen-sink tests redistributed from the original
+    tests.py monolith. Future PRs should split these further
+    into purpose-specific TestCase subclasses as natural
+    groupings emerge."""
+
+    def testSyslogClientUdpInit(self):
+        """SyslogClient creates UDP handler"""
+        from parsedmarc.syslog import SyslogClient
+
+        client = SyslogClient("localhost", 514, protocol="udp")
+        self.assertEqual(client.server_name, "localhost")
+        self.assertEqual(client.server_port, 514)
+        self.assertEqual(client.protocol, "udp")
+
+    def testSyslogClientInvalidProtocol(self):
+        """SyslogClient with invalid protocol raises ValueError"""
+        from parsedmarc.syslog import SyslogClient
+
+        with self.assertRaises(ValueError):
+            SyslogClient("localhost", 514, protocol="invalid")
+
+    def testSyslogBackwardCompatAlias(self):
+        """SyslogClient forensic alias points to failure method"""
+        from parsedmarc.syslog import SyslogClient
+
+        self.assertIs(
+            SyslogClient.save_forensic_report_to_syslog,  # type: ignore[attr-defined]
+            SyslogClient.save_failure_report_to_syslog,
+        )
+
+
+if __name__ == "__main__":
+    unittest.main(verbosity=2)
diff --git a/tests/test_utils.py b/tests/test_utils.py
new file mode 100644
index 0000000..8a889ab
--- /dev/null
+++ b/tests/test_utils.py
@@ -0,0 +1,722 @@
+"""Tests for parsedmarc.utils"""
+
+import os
+import tempfile
+import unittest
+from datetime import datetime, timezone
+from tempfile import NamedTemporaryFile
+from unittest.mock import MagicMock, patch
+
+import dns.exception
+import requests
+from expiringdict import ExpiringDict
+
+import parsedmarc
+import parsedmarc.utils
+
+
+class Test(unittest.TestCase):
+    """Kitchen-sink tests redistributed from the original
+    tests.py monolith. Future PRs should split these further
+    into purpose-specific TestCase subclasses as natural
+    groupings emerge."""
+
+    def testBase64Decoding(self):
+        """Test base64 decoding"""
+        # Example from Wikipedia Base64 article
+        b64_str = "YW55IGNhcm5hbCBwbGVhcw"
+        decoded_str = parsedmarc.utils.decode_base64(b64_str)
+        self.assertEqual(decoded_str, b"any carnal pleas")
+
+    def testPSLDownload(self):
+        """Test Public Suffix List domain lookups"""
+        subdomain = "foo.example.com"
+        result = parsedmarc.utils.get_base_domain(subdomain)
+        self.assertEqual(result, "example.com")
+
+        # psl_overrides.txt intentionally folds CDN-customer PTRs so every
+        # sender on the same network clusters under one display key.
+        # ``.akamaiedge.net`` is an override, so its subdomains collapse to
+        # ``akamaiedge.net`` even though the live PSL carries the finer-grained
+        # ``c.akamaiedge.net`` — the override is the design decision.
+        subdomain = "e3191.c.akamaiedge.net"
+        result = parsedmarc.utils.get_base_domain(subdomain)
+        assert result == "akamaiedge.net"
+
+    def testIpAddressInfoSurfacesASNFields(self):
+        """ASN number, name, and domain from the bundled MMDB appear on every
+        IP info result, even when no PTR resolves."""
+        info = parsedmarc.utils.get_ip_address_info("8.8.8.8", offline=True)
+        self.assertEqual(info["asn"], 15169)
+        self.assertIsInstance(info["asn"], int)
+        self.assertEqual(info["as_domain"], "google.com")
+        self.assertTrue(info["as_name"])
+
+    def testIpAddressInfoFallsBackToASNMapEntryWhenNoPTR(self):
+        """When reverse DNS is absent, the ASN domain should be used as a
+        lookup into the reverse_dns_map so the row still gets attributed,
+        while reverse_dns and base_domain remain null."""
+        info = parsedmarc.utils.get_ip_address_info("8.8.8.8", offline=True)
+        self.assertIsNone(info["reverse_dns"])
+        self.assertIsNone(info["base_domain"])
+        self.assertEqual(info["name"], "Google (Including Gmail and Google Workspace)")
+        self.assertEqual(info["type"], "Email Provider")
+
+    def testIpAddressInfoFallsBackToRawASNameOnMapMiss(self):
+        """When neither PTR nor an ASN-map entry resolves, the raw AS name
+        is used as source_name with type left null — better than leaving
+        the row unattributed."""
+        # 204.79.197.100 is in an ASN whose as_domain is not in the map at
+        # the time of this test (msn.com); this exercises the as_name
+        # fallback branch without depending on a specific map state.
+        from unittest.mock import patch
+
+        with patch(
+            "parsedmarc.utils.get_ip_address_db_record",
+            return_value={
+                "country": "US",
+                "asn": 64496,
+                "as_name": "Some Unmapped Org, Inc.",
+                "as_domain": "unmapped-for-this-test.example",
+            },
+        ):
+            # Bypass cache to avoid prior-test pollution.
+            info = parsedmarc.utils.get_ip_address_info(
+                "192.0.2.1", offline=True, cache=None
+            )
+        self.assertIsNone(info["reverse_dns"])
+        self.assertIsNone(info["base_domain"])
+        self.assertIsNone(info["type"])
+        self.assertEqual(info["name"], "Some Unmapped Org, Inc.")
+        self.assertEqual(info["as_domain"], "unmapped-for-this-test.example")
+
+    def testWeakFallbackAttributionIsNotCached(self):
+        """A transient PTR lookup failure that lands on the raw-as_name
+        fallback must not poison the cache. ``get_reverse_dns()`` swallows
+        every DNSException as ``None``, so a timeout looks identical to a
+        real no-PTR case — if we cached the weak attribution, the 4-hour
+        TTL would lock in a misattribution even after the PTR returns.
+
+        PTR-backed matches and ASN-domain matches are stable attributions
+        and must still be cached, so we only skip the specific
+        ``reverse_dns=None AND type=None AND name=as_name`` state."""
+        from unittest.mock import patch
+        from expiringdict import ExpiringDict
+
+        cache = ExpiringDict(max_len=100, max_age_seconds=14400)
+
+        # Scenario 1: weak fallback (no PTR, unmapped as_domain, raw as_name
+        # used). Must NOT be cached.
+        with patch(
+            "parsedmarc.utils.get_ip_address_db_record",
+            return_value={
+                "country": "US",
+                "asn": 64496,
+                "as_name": "Some Unmapped Org, Inc.",
+                "as_domain": "unmapped-for-this-test.example",
+            },
+        ):
+            parsedmarc.utils.get_ip_address_info("192.0.2.1", offline=True, cache=cache)
+        self.assertNotIn("192.0.2.1", cache)
+
+        # Scenario 2: ASN-domain match (no PTR, as_domain IS in the map).
+        # Stable attribution — must still be cached.
+        with patch(
+            "parsedmarc.utils.get_ip_address_db_record",
+            return_value={
+                "country": "US",
+                "asn": 15169,
+                "as_name": "Google LLC",
+                "as_domain": "google.com",
+            },
+        ):
+            parsedmarc.utils.get_ip_address_info("192.0.2.2", offline=True, cache=cache)
+        self.assertIn("192.0.2.2", cache)
+
+    def testIPinfoAPIPrimarySourceAndInvalidKeyIsFatal(self):
+        """With an API token configured, lookups hit the API first via the
+        documented ?token= query param. A 401/403 response propagates as
+        ``InvalidIPinfoAPIKey`` so the CLI can exit fatally. Any other
+        non-2xx or network error falls through to the MMDB silently.
+
+        The IPinfo Lite API is documented as having no request limit, so
+        there is no rate-limit/quota handling to test — only the fatal path
+        on invalid tokens and the success path."""
+        from unittest.mock import patch, MagicMock
+
+        from parsedmarc.utils import (
+            InvalidIPinfoAPIKey,
+            configure_ipinfo_api,
+            get_ip_address_db_record,
+        )
+
+        def _mock_response(status_code, json_body=None):
+            resp = MagicMock()
+            resp.status_code = status_code
+            resp.ok = 200 <= status_code < 300
+            resp.json.return_value = json_body or {}
+            return resp
+
+        try:
+            # Success: API returns IPinfo-schema JSON; record comes from API.
+            api_json = {
+                "ip": "8.8.8.8",
+                "asn": "AS15169",
+                "as_name": "Google LLC",
+                "as_domain": "google.com",
+                "country_code": "US",
+            }
+            with patch(
+                "parsedmarc.utils.requests.get",
+                return_value=_mock_response(200, api_json),
+            ) as mock_get:
+                configure_ipinfo_api("fake-token", probe=False)
+                record = get_ip_address_db_record("8.8.8.8")
+            self.assertEqual(record["country"], "US")
+            self.assertEqual(record["asn"], 15169)
+            self.assertEqual(record["as_domain"], "google.com")
+            # Auth must use the documented query param, not a Bearer header.
+            _, kwargs = mock_get.call_args
+            self.assertEqual(kwargs["params"], {"token": "fake-token"})
+            self.assertNotIn("Authorization", kwargs["headers"])
+
+            # Invalid key: 401 raises a fatal exception even on a random lookup.
+            with patch(
+                "parsedmarc.utils.requests.get",
+                return_value=_mock_response(401),
+            ):
+                configure_ipinfo_api("bad-token", probe=False)
+                with self.assertRaises(InvalidIPinfoAPIKey):
+                    get_ip_address_db_record("8.8.8.8")
+
+            # Any other non-2xx (e.g. 500, 503) falls back to the MMDB silently.
+            configure_ipinfo_api("fake-token", probe=False)
+            with patch(
+                "parsedmarc.utils.requests.get",
+                return_value=_mock_response(500),
+            ):
+                record = get_ip_address_db_record("8.8.8.8")
+            # MMDB fallback fills in Google's ASN from the bundled MMDB.
+            self.assertEqual(record["asn"], 15169)
+        finally:
+            configure_ipinfo_api(None)
+
+    def testTimestampToDatetime(self):
+        """timestamp_to_datetime converts UNIX timestamp to datetime"""
+        from datetime import datetime
+
+        ts = 1704067200
+        dt = parsedmarc.utils.timestamp_to_datetime(ts)
+        self.assertIsInstance(dt, datetime)
+        # Should match stdlib fromtimestamp (local time)
+        self.assertEqual(dt, datetime.fromtimestamp(ts))
+
+    def testTimestampToHuman(self):
+        """timestamp_to_human returns formatted string"""
+        result = parsedmarc.utils.timestamp_to_human(1704067200)
+        self.assertRegex(result, r"\d{4}-\d{2}-\d{2} \d{2}:\d{2}:\d{2}")
+
+    def testHumanTimestampToDatetime(self):
+        """human_timestamp_to_datetime parses timestamp string"""
+        dt = parsedmarc.utils.human_timestamp_to_datetime("2024-01-01 00:00:00")
+        self.assertIsInstance(dt, datetime)
+        self.assertEqual(dt.year, 2024)
+        self.assertEqual(dt.month, 1)
+        self.assertEqual(dt.day, 1)
+
+    def testHumanTimestampToDatetimeUtc(self):
+        """human_timestamp_to_datetime with to_utc=True returns UTC"""
+        dt = parsedmarc.utils.human_timestamp_to_datetime(
+            "2024-01-01 12:00:00", to_utc=True
+        )
+        self.assertEqual(dt.tzinfo, timezone.utc)
+
+    def testHumanTimestampToDatetimeParenthesisStripping(self):
+        """Parenthesized content is stripped from timestamps"""
+        dt = parsedmarc.utils.human_timestamp_to_datetime(
+            "Mon, 01 Jan 2024 00:00:00 +0000 (UTC)"
+        )
+        self.assertEqual(dt.year, 2024)
+
+    def testHumanTimestampToDatetimeNegativeZero(self):
+        """-0000 timezone is handled"""
+        dt = parsedmarc.utils.human_timestamp_to_datetime("2024-01-01 00:00:00 -0000")
+        self.assertEqual(dt.year, 2024)
+
+    def testHumanTimestampToUnixTimestamp(self):
+        """human_timestamp_to_unix_timestamp converts to int"""
+        ts = parsedmarc.utils.human_timestamp_to_unix_timestamp("2024-01-01 00:00:00")
+        self.assertIsInstance(ts, int)
+
+    def testHumanTimestampToUnixTimestampWithT(self):
+        """T separator in timestamp is handled"""
+        ts = parsedmarc.utils.human_timestamp_to_unix_timestamp("2024-01-01T00:00:00")
+        self.assertIsInstance(ts, int)
+
+    def testGetIpAddressCountry(self):
+        """get_ip_address_country returns country code using bundled DBIP"""
+        # 8.8.8.8 is a well-known Google DNS IP in US
+        country = parsedmarc.utils.get_ip_address_country("8.8.8.8")
+        self.assertEqual(country, "US")
+
+    def testGetIpAddressCountryNotFound(self):
+        """get_ip_address_country returns None for reserved IP"""
+        country = parsedmarc.utils.get_ip_address_country("127.0.0.1")
+        self.assertIsNone(country)
+
+    def testGetServiceFromReverseDnsBaseDomainOffline(self):
+        """get_service_from_reverse_dns_base_domain in offline mode"""
+        result = parsedmarc.utils.get_service_from_reverse_dns_base_domain(
+            "google.com", offline=True
+        )
+        self.assertIn("Google", result["name"])
+        self.assertIsNotNone(result["type"])
+
+    def testGetServiceFromReverseDnsBaseDomainUnknown(self):
+        """Unknown base domain returns domain as name and None as type"""
+        result = parsedmarc.utils.get_service_from_reverse_dns_base_domain(
+            "unknown-domain-xyz.example", offline=True
+        )
+        self.assertEqual(result["name"], "unknown-domain-xyz.example")
+        self.assertIsNone(result["type"])
+
+    def testGetIpAddressInfoOffline(self):
+        """get_ip_address_info in offline mode returns country but no DNS"""
+        info = parsedmarc.utils.get_ip_address_info("8.8.8.8", offline=True)
+        self.assertEqual(info["ip_address"], "8.8.8.8")
+        self.assertEqual(info["country"], "US")
+        self.assertIsNone(info["reverse_dns"])
+
+    def testGetIpAddressInfoCache(self):
+        """get_ip_address_info uses cache on second call"""
+        from expiringdict import ExpiringDict
+
+        cache = ExpiringDict(max_len=100, max_age_seconds=60)
+        with patch("parsedmarc.utils.get_reverse_dns", return_value="dns.google"):
+            info1 = parsedmarc.utils.get_ip_address_info(
+                "8.8.8.8",
+                offline=False,
+                cache=cache,
+                always_use_local_files=True,
+            )
+        self.assertIn("8.8.8.8", cache)
+        info2 = parsedmarc.utils.get_ip_address_info(
+            "8.8.8.8", offline=False, cache=cache
+        )
+        self.assertEqual(info1["ip_address"], info2["ip_address"])
+        self.assertEqual(info2["reverse_dns"], "dns.google")
+
+    def testParseEmailAddressWithDisplayName(self):
+        """parse_email_address with display name"""
+        result = parsedmarc.utils.parse_email_address(("John Doe", "john@example.com"))  # type: ignore[arg-type]
+        self.assertEqual(result["display_name"], "John Doe")
+        self.assertEqual(result["address"], "john@example.com")
+        self.assertEqual(result["local"], "john")
+        self.assertEqual(result["domain"], "example.com")
+
+    def testParseEmailAddressWithoutDisplayName(self):
+        """parse_email_address with empty display name"""
+        result = parsedmarc.utils.parse_email_address(("", "john@example.com"))  # type: ignore[arg-type]
+        self.assertIsNone(result["display_name"])
+        self.assertEqual(result["address"], "john@example.com")
+
+    def testParseEmailAddressNoAt(self):
+        """parse_email_address with no @ returns None local/domain"""
+        result = parsedmarc.utils.parse_email_address(("", "localonly"))  # type: ignore[arg-type]
+        self.assertIsNone(result["local"])
+        self.assertIsNone(result["domain"])
+
+    def testGetFilenameSafeString(self):
+        """get_filename_safe_string removes invalid chars"""
+        result = parsedmarc.utils.get_filename_safe_string('file/name:with"bad*chars')
+        self.assertNotIn("/", result)
+        self.assertNotIn(":", result)
+        self.assertNotIn('"', result)
+        self.assertNotIn("*", result)
+
+    def testGetFilenameSafeStringNone(self):
+        """get_filename_safe_string with None returns 'None'"""
+        result = parsedmarc.utils.get_filename_safe_string(None)  # type: ignore[arg-type]
+        self.assertEqual(result, "None")
+
+    def testGetFilenameSafeStringLong(self):
+        """get_filename_safe_string truncates to 100 chars"""
+        result = parsedmarc.utils.get_filename_safe_string("a" * 200)
+        self.assertEqual(len(result), 100)
+
+    def testGetFilenameSafeStringTrailingDot(self):
+        """get_filename_safe_string strips trailing dots"""
+        result = parsedmarc.utils.get_filename_safe_string("filename...")
+        self.assertFalse(result.endswith("."))
+
+    def testIsMboxNonMbox(self):
+        """is_mbox returns False for non-mbox file"""
+        result = parsedmarc.utils.is_mbox("samples/empty.xml")
+        self.assertFalse(result)
+
+    def testIsOutlookMsgNonMsg(self):
+        """is_outlook_msg returns False for non-MSG content"""
+        self.assertFalse(parsedmarc.utils.is_outlook_msg(b"not an outlook msg"))
+        self.assertFalse(parsedmarc.utils.is_outlook_msg("string content"))
+
+    def testIsOutlookMsgMagic(self):
+        """is_outlook_msg returns True for correct magic bytes"""
+        magic = b"\xd0\xcf\x11\xe0\xa1\xb1\x1a\xe1" + b"\x00" * 100
+        self.assertTrue(parsedmarc.utils.is_outlook_msg(magic))
+
+
+class TestLoadPSLOverrides(unittest.TestCase):
+    """Covers `parsedmarc.utils.load_psl_overrides`."""
+
+    def setUp(self):
+        # Snapshot the module-level list so each test leaves it as it found it.
+        self._saved = list(parsedmarc.utils.psl_overrides)
+
+    def tearDown(self):
+        parsedmarc.utils.psl_overrides.clear()
+        parsedmarc.utils.psl_overrides.extend(self._saved)
+
+    def test_offline_loads_bundled_file(self):
+        """offline=True populates the list from the bundled file, no network."""
+        result = parsedmarc.utils.load_psl_overrides(offline=True)
+        self.assertIs(result, parsedmarc.utils.psl_overrides)
+        self.assertGreater(len(result), 0)
+        # The bundled file is expected to contain at least one well-known entry.
+        self.assertIn(".linode.com", result)
+
+    def test_local_file_path_overrides_bundled(self):
+        """A custom local_file_path takes precedence over the bundled copy."""
+        with tempfile.NamedTemporaryFile(
+            "w", suffix=".txt", delete=False, encoding="utf-8"
+        ) as tf:
+            tf.write("-custom-brand.com\n.another-brand.net\n\n   \n")
+            path = tf.name
+        try:
+            result = parsedmarc.utils.load_psl_overrides(
+                offline=True, local_file_path=path
+            )
+            self.assertEqual(result, ["-custom-brand.com", ".another-brand.net"])
+        finally:
+            os.unlink(path)
+
+    def test_clear_before_reload(self):
+        """Re-running load_psl_overrides replaces the list, not appends."""
+        parsedmarc.utils.psl_overrides.clear()
+        parsedmarc.utils.psl_overrides.append(".stale-entry.com")
+        parsedmarc.utils.load_psl_overrides(offline=True)
+        self.assertNotIn(".stale-entry.com", parsedmarc.utils.psl_overrides)
+
+    def test_url_success(self):
+        """A 200 response from the URL populates the list."""
+        fake_body = "-fetched-brand.com\n.cdn-fetched.net\n"
+        mock_response = MagicMock()
+        mock_response.text = fake_body
+        mock_response.raise_for_status = MagicMock()
+        with patch(
+            "parsedmarc.utils.requests.get", return_value=mock_response
+        ) as mock_get:
+            result = parsedmarc.utils.load_psl_overrides(url="https://example.test/ov")
+            self.assertEqual(result, ["-fetched-brand.com", ".cdn-fetched.net"])
+            mock_get.assert_called_once()
+
+    def test_url_failure_falls_back_to_local(self):
+        """A network error falls back to the bundled copy."""
+        import requests
+
+        with patch(
+            "parsedmarc.utils.requests.get",
+            side_effect=requests.exceptions.ConnectionError("nope"),
+        ):
+            result = parsedmarc.utils.load_psl_overrides(url="https://example.test/ov")
+        # Bundled file still loaded.
+        self.assertGreater(len(result), 0)
+        self.assertIn(".linode.com", result)
+
+    def test_always_use_local_skips_network(self):
+        """always_use_local_file=True must not call requests.get."""
+        with patch("parsedmarc.utils.requests.get") as mock_get:
+            parsedmarc.utils.load_psl_overrides(always_use_local_file=True)
+            mock_get.assert_not_called()
+
+
+class TestLoadReverseDnsMapReloadsPSLOverrides(unittest.TestCase):
+    """`load_reverse_dns_map` must reload `psl_overrides.txt` in the same call
+    so map entries that depend on folded bases resolve correctly."""
+
+    def setUp(self):
+        self._saved = list(parsedmarc.utils.psl_overrides)
+
+    def tearDown(self):
+        parsedmarc.utils.psl_overrides.clear()
+        parsedmarc.utils.psl_overrides.extend(self._saved)
+
+    def test_map_load_triggers_psl_reload(self):
+        """Calling load_reverse_dns_map offline also invokes load_psl_overrides
+        with matching flags, and the overrides list is repopulated."""
+        rdm = {}
+        parsedmarc.utils.psl_overrides.clear()
+        parsedmarc.utils.psl_overrides.append(".stale-from-before.com")
+        with patch(
+            "parsedmarc.utils.load_psl_overrides",
+            wraps=parsedmarc.utils.load_psl_overrides,
+        ) as spy:
+            parsedmarc.utils.load_reverse_dns_map(rdm, offline=True)
+        spy.assert_called_once()
+        kwargs = spy.call_args.kwargs
+        self.assertTrue(kwargs["offline"])
+        self.assertIsNone(kwargs["url"])
+        self.assertIsNone(kwargs["local_file_path"])
+        self.assertNotIn(".stale-from-before.com", parsedmarc.utils.psl_overrides)
+
+    def test_map_load_forwards_psl_overrides_kwargs(self):
+        """psl_overrides_path / psl_overrides_url are forwarded verbatim."""
+        rdm = {}
+        with patch("parsedmarc.utils.load_psl_overrides") as spy:
+            parsedmarc.utils.load_reverse_dns_map(
+                rdm,
+                offline=True,
+                always_use_local_file=True,
+                psl_overrides_path="/tmp/custom.txt",
+                psl_overrides_url="https://example.test/ov",
+            )
+        spy.assert_called_once_with(
+            always_use_local_file=True,
+            local_file_path="/tmp/custom.txt",
+            url="https://example.test/ov",
+            offline=True,
+        )
+
+
+class TestGetBaseDomainWithOverrides(unittest.TestCase):
+    """`get_base_domain` must honour the current psl_overrides list."""
+
+    def setUp(self):
+        self._saved = list(parsedmarc.utils.psl_overrides)
+        parsedmarc.utils.psl_overrides.clear()
+        parsedmarc.utils.psl_overrides.extend([".cprapid.com", "-nobre.com.br"])
+
+    def tearDown(self):
+        parsedmarc.utils.psl_overrides.clear()
+        parsedmarc.utils.psl_overrides.extend(self._saved)
+
+    def test_dot_prefixed_override_folds_subdomain(self):
+        result = parsedmarc.utils.get_base_domain("74-208-244-234.cprapid.com")
+        self.assertEqual(result, "cprapid.com")
+
+    def test_dash_prefixed_override_folds_subdomain(self):
+        result = parsedmarc.utils.get_base_domain("host-1-2-3-4-nobre.com.br")
+        self.assertEqual(result, "nobre.com.br")
+
+    def test_unmatched_domain_falls_through_to_psl(self):
+        result = parsedmarc.utils.get_base_domain("sub.example.com")
+        self.assertEqual(result, "example.com")
+
+
+class TestUtilsDnsCaching(unittest.TestCase):
+    """Tests for DNS query caching and reverse DNS error handling"""
+
+    def testQueryDnsUsesCacheHit(self):
+        """query_dns returns cached result without making DNS query"""
+        cache = ExpiringDict(max_len=100, max_age_seconds=60)
+        cache["example.com_A"] = ["1.2.3.4"]
+        result = parsedmarc.utils.query_dns("example.com", "A", cache=cache)
+        self.assertEqual(result, ["1.2.3.4"])
+
+    def testQueryDnsCachesResult(self):
+        """query_dns stores result in cache when cache is non-empty"""
+        cache = ExpiringDict(max_len=100, max_age_seconds=60)
+        # Pre-populate so ExpiringDict is truthy
+        cache["seed_key"] = ["seed"]
+        mock_record = MagicMock()
+        mock_record.to_text.return_value = '"1.2.3.4"'
+        mock_resolver = MagicMock()
+        mock_resolver.resolve.return_value = [mock_record]
+        with patch(
+            "parsedmarc.utils.dns.resolver.Resolver", return_value=mock_resolver
+        ):
+            result = parsedmarc.utils.query_dns(
+                "test-cache.example.com", "A", cache=cache
+            )
+            self.assertEqual(result, ["1.2.3.4"])
+            self.assertIn("test-cache.example.com_A", cache)
+
+    def testReverseDnsReturnsNoneOnFailure(self):
+        """get_reverse_dns returns None on DNS exceptions"""
+        with patch(
+            "parsedmarc.utils.query_dns",
+            side_effect=dns.exception.DNSException("timeout"),
+        ):
+            result = parsedmarc.utils.get_reverse_dns("203.0.113.1")
+            self.assertIsNone(result)
+
+
+class TestUtilsIpDbPaths(unittest.TestCase):
+    """Tests for IP database path validation"""
+
+    def testCustomPathFallsBack(self):
+        """Non-existent custom db path falls back to default"""
+        result = parsedmarc.utils.get_ip_address_country(
+            "1.1.1.1", db_path="/nonexistent/path.mmdb"
+        )
+        self.assertTrue(result is None or isinstance(result, str))
+
+    def testBundledDbWorks(self):
+        """Bundled IP database returns results"""
+        result = parsedmarc.utils.get_ip_address_country("8.8.8.8")
+        self.assertEqual(result, "US")
+
+
+class TestUtilsParseEmail(unittest.TestCase):
+    """Tests for parse_email edge cases"""
+
+    def testMinimalEmail(self):
+        """parse_email handles email with minimal headers"""
+        email_str = """From: test@example.com
+Subject: Test
+
+Body text"""
+        result = parsedmarc.utils.parse_email(email_str)
+        self.assertEqual(result["subject"], "Test")
+        self.assertEqual(result["reply_to"], [])
+
+    def testEmailWithNoSubject(self):
+        """parse_email defaults subject to None when missing"""
+        email_str = """From: test@example.com
+To: other@example.com
+
+Body"""
+        result = parsedmarc.utils.parse_email(email_str)
+        self.assertIsNone(result["subject"])
+
+    def testEmailBytesInput(self):
+        """parse_email handles bytes input"""
+        email_bytes = b"""From: test@example.com
+Subject: Bytes Test
+To: other@example.com
+
+Body"""
+        result = parsedmarc.utils.parse_email(email_bytes)
+        self.assertEqual(result["subject"], "Bytes Test")
+
+    def testEmailWithAttachments(self):
+        """parse_email with strip_attachment_payloads removes payloads"""
+        from email.mime.multipart import MIMEMultipart
+        from email.mime.text import MIMEText
+        from email.mime.base import MIMEBase
+        from email import encoders
+
+        msg = MIMEMultipart()
+        msg["From"] = "test@example.com"
+        msg["To"] = "other@example.com"
+        msg["Subject"] = "Attachment Test"
+        msg.attach(MIMEText("Body text"))
+
+        attachment = MIMEBase("application", "octet-stream")
+        attachment.set_payload(b"file content here")
+        encoders.encode_base64(attachment)
+        attachment.add_header("Content-Disposition", "attachment", filename="test.bin")
+        msg.attach(attachment)
+
+        result = parsedmarc.utils.parse_email(
+            msg.as_string(), strip_attachment_payloads=True
+        )
+        for att in result["attachments"]:
+            self.assertNotIn("payload", att)
+
+
+class TestUtilsOutlookMsg(unittest.TestCase):
+    """Tests for Outlook MSG detection and conversion"""
+
+    def testIsOutlookMsg(self):
+        """is_outlook_msg detects MSG magic bytes"""
+        msg_magic = b"\xd0\xcf\x11\xe0\xa1\xb1\x1a\xe1" + b"\x00" * 100
+        self.assertTrue(parsedmarc.utils.is_outlook_msg(msg_magic))
+
+    def testIsNotOutlookMsg(self):
+        """is_outlook_msg rejects non-MSG content"""
+        self.assertFalse(parsedmarc.utils.is_outlook_msg(b"not an msg file"))
+        self.assertFalse(parsedmarc.utils.is_outlook_msg("string input"))
+
+    def testConvertOutlookMsgInvalidInput(self):
+        """convert_outlook_msg raises ValueError for non-MSG bytes"""
+        with self.assertRaises(ValueError):
+            parsedmarc.utils.convert_outlook_msg(b"not an msg file")
+
+
+class TestUtilsReverseDnsMap(unittest.TestCase):
+    """Tests for reverse DNS map loading"""
+
+    def testLoadReverseDnsMapOffline(self):
+        """load_reverse_dns_map in offline mode loads bundled map"""
+        rdns_map = {}
+        parsedmarc.utils.load_reverse_dns_map(rdns_map, offline=True)
+        self.assertTrue(len(rdns_map) > 0)
+
+    def testLoadReverseDnsMapLocalOverride(self):
+        """load_reverse_dns_map uses local_file_path when provided"""
+        with NamedTemporaryFile("w", suffix=".csv", delete=False) as f:
+            f.write("base_reverse_dns,name,type\n")
+            f.write("custom.example.com,Custom Service,hosting\n")
+            path = f.name
+        try:
+            rdns_map = {}
+            parsedmarc.utils.load_reverse_dns_map(
+                rdns_map, offline=True, local_file_path=path
+            )
+            self.assertIn("custom.example.com", rdns_map)
+            self.assertEqual(rdns_map["custom.example.com"]["name"], "Custom Service")
+        finally:
+            os.remove(path)
+
+    def testLoadReverseDnsMapNetworkFailureFallback(self):
+        """load_reverse_dns_map falls back to bundled on network error"""
+        rdns_map = {}
+        with patch(
+            "parsedmarc.utils.requests.get",
+            side_effect=requests.exceptions.ConnectionError("no network"),
+        ):
+            parsedmarc.utils.load_reverse_dns_map(rdns_map)
+        self.assertTrue(len(rdns_map) > 0)
+
+
+class TestPslOverrides(unittest.TestCase):
+    """Tests for PSL override matching"""
+
+    def testOverrideMatch(self):
+        """PSL overrides are applied when domain ends with override"""
+        # psl_overrides contains entries; test that get_base_domain
+        # handles them without error
+        result = parsedmarc.utils.get_base_domain("sub.example.com")
+        self.assertEqual(result, "example.com")
+
+
+class TestIsMbox(unittest.TestCase):
+    """Tests for is_mbox utility"""
+
+    def testValidMbox(self):
+        """is_mbox returns True for valid mbox file"""
+        with NamedTemporaryFile(suffix=".mbox", delete=False, mode="w") as f:
+            f.write("From test@example.com Thu Jan  1 00:00:00 2024\n")
+            f.write("Subject: Test\n\nBody\n\n")
+            path = f.name
+        try:
+            self.assertTrue(parsedmarc.utils.is_mbox(path))
+        finally:
+            os.remove(path)
+
+    def testEmptyFileNotMbox(self):
+        """is_mbox returns False for empty file"""
+        with NamedTemporaryFile(suffix=".mbox", delete=False) as f:
+            path = f.name
+        try:
+            self.assertFalse(parsedmarc.utils.is_mbox(path))
+        finally:
+            os.remove(path)
+
+    def testNonExistentNotMbox(self):
+        """is_mbox returns False for non-existent file"""
+        self.assertFalse(parsedmarc.utils.is_mbox("/nonexistent/file.mbox"))
+
+
+if __name__ == "__main__":
+    unittest.main(verbosity=2)
diff --git a/tests/test_webhook.py b/tests/test_webhook.py
new file mode 100644
index 0000000..c15e9ce
--- /dev/null
+++ b/tests/test_webhook.py
@@ -0,0 +1,76 @@
+"""Tests for parsedmarc.webhook"""
+
+import unittest
+from unittest.mock import MagicMock
+
+import parsedmarc
+import parsedmarc.webhook
+
+
+class Test(unittest.TestCase):
+    """Kitchen-sink tests redistributed from the original
+    tests.py monolith. Future PRs should split these further
+    into purpose-specific TestCase subclasses as natural
+    groupings emerge."""
+
+    def testWebhookClientInit(self):
+        """WebhookClient initializes with correct attributes"""
+        from parsedmarc.webhook import WebhookClient
+
+        client = WebhookClient(
+            aggregate_url="http://agg.example.com",
+            failure_url="http://fail.example.com",
+            smtp_tls_url="http://tls.example.com",
+        )
+        self.assertEqual(client.aggregate_url, "http://agg.example.com")
+        self.assertEqual(client.failure_url, "http://fail.example.com")
+        self.assertEqual(client.smtp_tls_url, "http://tls.example.com")
+        self.assertEqual(client.timeout, 60)
+
+    def testWebhookClientSaveMethods(self):
+        """WebhookClient save methods call _send_to_webhook"""
+        from parsedmarc.webhook import WebhookClient
+
+        client = WebhookClient("http://a", "http://f", "http://t")
+        client.session = MagicMock()
+        client.save_aggregate_report_to_webhook('{"test": 1}')
+        client.session.post.assert_called_with(
+            "http://a", data='{"test": 1}', timeout=60
+        )
+        client.save_failure_report_to_webhook('{"fail": 1}')
+        client.session.post.assert_called_with(
+            "http://f", data='{"fail": 1}', timeout=60
+        )
+        client.save_smtp_tls_report_to_webhook('{"tls": 1}')
+        client.session.post.assert_called_with(
+            "http://t", data='{"tls": 1}', timeout=60
+        )
+
+    def testWebhookBackwardCompatAlias(self):
+        """WebhookClient forensic alias points to failure method"""
+        from parsedmarc.webhook import WebhookClient
+
+        self.assertIs(
+            WebhookClient.save_forensic_report_to_webhook,  # type: ignore[attr-defined]
+            WebhookClient.save_failure_report_to_webhook,
+        )
+
+
+class TestWebhookClient(unittest.TestCase):
+    """Tests for webhook client initialization and close"""
+
+    def testClose(self):
+        """WebhookClient.close() closes session"""
+        client = parsedmarc.webhook.WebhookClient(
+            aggregate_url="http://invalid.test/agg",
+            failure_url="http://invalid.test/fail",
+            smtp_tls_url="http://invalid.test/tls",
+        )
+        mock_close = MagicMock()
+        client.session.close = mock_close
+        client.close()
+        mock_close.assert_called_once()
+
+
+if __name__ == "__main__":
+    unittest.main(verbosity=2)