Skip to content

Scraper and parser of Cleveland City Council's records and the produced text.

Notifications You must be signed in to change notification settings

opencleveland/drocer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

77 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

drocer

Cleveland City Council's legislative records are available as PDFs from 1996-present.

As PDFs, there's many things that you cannot to do with them: search in multiple files at one time; easily extract data from them for further analysis; browse legislation by a particular topic; and more.

In short, PDFs make it very difficult for programmers, journalists, researchers, and residents to use the data inside them.

Our goal is to remove those barriers.

What's here:

The year folders (e.g. 2017) consist of plain text files extracted from PDFs of The City Record, Cleveland City Council's legislative notes.

The files are not recommended for reading. Small errors (misspellings and improper spacing, especially for lists) were created during the text extraction. Tables and graphics in the City Record are also missing from these plain text files.

Check the original PDFs or download them by year in bulk to verify if any information is missing.

Learn more about our goals, project roadmap, and how you can help at:

http://www.opencleveland.org/projects/drocer/

https://github.com/opencleveland/drocer/wiki

Additional documentation and plans is scattered at: city-record-contents-formatting.md - boilerplate text that is used in the city record and what it means.

About

Scraper and parser of Cleveland City Council's records and the produced text.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published