Classify reverse DNS map: next 10000 unmapped MMDB ASN domains (#761)

Batch 12. Auto-rate dropped to 23% (2330/9998) — significantly lower than
batch 11. The deeper into the long tail, the more candidates fall into
non-classifier-recognized industries (retail, manufacturing, hospitality,
local services) where the ISP/Web Host/MSP regex doesn't fire even though
the page is fetchable.

- 2,330 added to base_reverse_dns_map.csv (ISP 991, Education 295, Finance
  290, Government 265, Web Host 229, Healthcare 134, MSP 126).
- 7,667 added to known_unknown_base_reverse_dns.txt.

ASN-domain coverage of the bundled IPinfo Lite MMDB after this batch:
  - by domain count:  27,824 / 63,993  (43.48%, up from 40.27%)
  - by IPv4 weight:   98.36%

Same classifier as prior batches (no new code).

Co-authored-by: Sean Whalen <seanthegeek@users.noreply.github.com>
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
Sean Whalen
2026-05-07 15:54:25 -04:00
committed by GitHub
parent e6716c9e80
commit fa03b8f2c2
2 changed files with 9997 additions and 0 deletions
File diff suppressed because it is too large Load Diff
File diff suppressed because it is too large Load Diff