Search

Search using this query type:



Search only these record types:

Item
Exhibit
Exhibit Page
Simple Page

Advanced Search (Items only)

Home > CCP Corpus

CCP Corpus

What is the CCP Corpus?

The CCP Corpus provides easy access to our growing collection of transcribed minutes and proceedings of the Colored Conventions.

The Colored Conventions Project seeks to bring the buried history of nineteenth-century Black organizing to digital life. Part of this work is enabling researchers to look for patterns across this broad, rich history in the language of the minutes themselves. These transcriptions are made possible by the dedicated volunteers of the CCP Transcribe Minutes initiative.

The CCP Corpus is designed to be easy to use in most large text-analysis applications, from Voyant Tools to topic modeling, the Natural Language Toolkit, or natural language processing.

How do I use it?

Click on the link below to start downloading the zipped folder with all of the CCP Corpus.

2016-11-ccp-corpus-0.2.zip (updated Nov. 5, 2016)

2015-11-ccp-corpus-0.1.zip

As the CCP remains committed to the large-scale recovery of the convention minutes, please be aware that this collection is as yet incomplete. All uses of these materials published online or in print should indicate the provisional nature of the CCP Corpus. The CCP Corpus will grow significantly with the progress of Transcribe Minutes. Each updated version will be titled with the year and month last updated (i.e. 2015-11-CCP-Corpus.zip).

The downloaded zip file includes:

  • A folder with all of the minutes in plain-text format.
  • A table of contents in CSV form with relevant event and bibliographic data for each of the minutes.
  • A text file named "Read Me" with notes about updates, permissions and citation guidelines.

The table of contents describes each of the texts in the collection, with a unique file id, public url, and event start date. Additional metadata is available upon request.

The Read Me file provides details about the collection's updates, reproduction permissions and guidelines on how to cite the CCP Corpus.

Feedback

We are eager to promote innovative uses of the convention minutes. If you are using these materials, or have any feedback to improve the CCP Corpus, please contact us at coloredconventions {at} udel.edu.

Copyright Statement

The Colored Conventions Project Corpus is being released under a Creative Commons Attribution-NonCommercial 4.0 International License. Please credit the Colored Conventions Project for providing access to these materials.

Word Trends using the CCP Corpus

The tool below was created using Voyant Tools and all of the documents added to the CCP Corpus by November 2016.

Use the search box on the bottom of the window below to see trends of words used in the minutes over time. To search for multiple words, put a comma between the search terms. The tool can be used to search for the names as they appear in the minutes as well.

If the tool has broken, please let us know. If you find anything of interest, please do share it with us by email or on Twitter

You can also use a full screen version

 

Word Trends using the CCP Corpus

The tool below was created using Voyant Tools and all of the documents added to the CCP Corpus by November 2015.

Use the search box on the bottom of the window below to see trends of words used in the minutes over time. To search for multiple words, put a comma between the search terms. The tool can be used to search for the names as they appear in the minutes as well.

If the tool has broken, please let us know. If you find anything of interest, please do share it with us by email or on Twitter

You can also use a full screen version