CURATEcamp Processing 2012

From CURATEcamp
Jump to: navigation, search

Link to CURATEcamp Processing 2012 Schedule

There will be a one-day CURATEcamp Following this years DigitalPreservation 2012 conference in Arlington, VA. Registration is open now! & Space is limited. We are focusing this camp on the idea of processing, bringing together the computational sense of the word with the archival sense of it. We are particularly excited about bringing together archivists and curators with software developers and engineers to do some creative thinking and tinkering. You can read up on the topic below. The Camp is being facilitated by Trevor Owens and Leslie Johnston from the Library of Congress Meg Phillips, Electronic Records Lifecycle Coordinator at the National Archives and Records Administration and Mark Matienzo, Digital Archivist at Yale University.

  • WHEN: Thursday July 26, 2012 (9am - 3pm)
  • WHERE: Sheraton Pentagon City, 900 South Orme Street, Arlington, VA 22204
  • COST: Free, there is no cost to register for the meeting
  • LOGISTICS: The hotel will offer a continuous shuttle to transport guests from Pentagon City Metro between 8-9am and 2:30-3:30pm on July 26. Parking is $10 per day with discount sticker.
  • DISCUSSION: #curatecamp on Twitter and #curatecamp on

Space is limited to 100 registrants, so reserve your spot while you can

Participants have already started sharing ideas on the camp announcement post. Please take a minute to share ideas you have for the session on the announcement blog post.

CURATEcamp Processing: Processing Data/Processing Collections

Processing means different things to an archivist and a software developer. To the former, processing is about taking custody of collections, preserving context, and providing arrangement, description, and accessibility. Processing, in its analog archival sense, also includes a lot of preservation, (stabilization, preliminary conservation assessment, and the dreaded “re-housing”). To the latter, processing is about computer processing and has to do with how one automates a range of tasks through computation. When a cultural heritage organization’s work is organized around processing digital objects, these two notions of processing intermingle. This CurateCamp unconference is intended to put these two notions of processing together in whatever ways can be imagined by the curators, archivists, librarians, scholars, software developers, computer engineers, and others that attend.

Potential topics and considerations could include:

  • Automated inventorying and file characterization
  • Computational determination of hierarchical arrangement
  • Format validation & migrations
  • Automated metadata extraction
  • Potential roles for entity extraction in subject cataloging
  • Dynamically generated description
  • Malware scanning
  • Pattern & fuzzy searching for PII, SSNs, etc
  • Automated access restrictions
  • Generating visualizations and using them as access tools
  • Human computation’s potential role in cultural heritage collections
  • Machine learning and digital collections
  • Using name authority linked data
  • Processes for geo-refferencing
  • Potential uses of facial recognition tools for identifying individuals in collection images