Takes a free-text company name — from a CRM, a sponsor list, a regulatory filing — and returns its canonical domain. Trivial-looking, surprisingly hard at scale.
The first step on any dataset where the source gives a name but not a URL. CRM exports, partner lists, sponsor lists, conference attendees, regulatory filings. Without it, every downstream agent has nothing to attach to.
The domain is the join key for everything downstream. A high resolution rate means downstream tools can operate; a low rate is itself the diagnostic — it tells you the source data is thin and the dataset may not be worth processing.