Skip to content

inventory.data.gov

Aaron D Borden edited this page Dec 4, 2019 · 24 revisions

a.k.a Inventory is used by federal agencies to manage metadata for their datasets. Inventory is used to generate the agency's data.json which must be hosted on the agency's website (e.g. agency.gov/data.json). Inventory is a CKAN instance and can be used to host datasets in addition to metadata.

Environments

Instance Url
Production inventory.data.gov
Staging inventory-datagov.dev-ocsit.bsp.gsa.gov
ci inventory.ci.datagov.us

Dependencies

Sub-components:

  • ckan
  • datapusher

Services:

  • apache2
  • rds
  • redis
  • s3
  • solr

Common tasks

Importing from data.json

ckanpyimport is used in onboarding new agencies to inventory.data.gov. This tool imports datasets from a data.json file. The script can take a while depending on how many packages need to be imported. It's best to run this from the jumpbox using nohup. It's a good idea to test against staging before running against production.

Clone this wiki locally