VA API Landscape Analysis and Roadmapping Project
In support of this project, we spidered the va.gov domain to discover existing datasets that should be turned into APIs. Along the way, we learned a lot about how the VA manages its data and maintains its web presence.
This is a snapshot of the data that was targeted, processed, and identified, including summary counts, visual tag clouds, listings, and the underlying raw data.
- All (4,058,278 targeted w/ 1,216,346 processed ) Tag Cloud Tag List
- Program Subdomains (133) Tag Cloud Domain List
- City Subdomains (120) Tag Cloud Domain List
- State Subdomains (22) Tag Cloud Domain List
- Paths Tag Cloud Tag List URLs
- CSV (534) Tag Cloud Tag List URLs
- JSON (467) Tag Cloud Tag List URLs
- XML (3,099) Tag Cloud Tag List URLs
- XLS/XLSX (6,077) Tag Cloud Tag List URLs
- Table (8,393) Tag Cloud Tag List URLs
- Form (9,439) Tag Cloud Tag List URLs
- Data.gov (519) Tag Cloud Tag List URLs
- Our Final Report
The data behind all of this is available as JSON files in the Github repository for the project, and will be updated daily as we continue spidering the site.