Methodology

Data is primarily pulled from state-published open data CSVs and dynamically updated scraped facts. Due to obfuscation requirements (e.g., SPEC 10.7), some confidential personal information is stripped or merged to ensure privacy while maintaining macro aggregate accuracy.

A fuzzy-matching algorithm driven by `pg_trgm` links disparate payees (with typos or alternate acronyms) to a centralized Vendor Master list indicating HUB status and standardized business location.