Classify reverse DNS map: next 5000 unmapped MMDB ASN domains (#758)

Auto-classification rate jumped back to 50% (2502/4999) from 36.5% in
batch 8 — this slice happens to contain a higher proportion of small ISPs
with conventional homepages, lifting the regex hit rate.

- 2,502 added to base_reverse_dns_map.csv (ISP 2,065, Web Host 133,
  Education 96, Finance 67, MSP 60, Government 58, Healthcare 23).
- 2,496 added to known_unknown_base_reverse_dns.txt.

ASN-domain coverage of the bundled IPinfo Lite MMDB after this batch:
  - by domain count:  19,154 / 63,993  (29.93%, up from 26.02%)
  - by IPv4 weight:   98.18%           (up from 98.09%)

Same classifier as batches 5-8 (no new code).

Co-authored-by: Sean Whalen <seanthegeek@users.noreply.github.com>
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
Sean Whalen
2026-05-07 13:46:28 -04:00
committed by GitHub
parent c523d0da9c
commit 80a132801d
2 changed files with 4998 additions and 0 deletions
File diff suppressed because it is too large Load Diff
File diff suppressed because it is too large Load Diff