Association of Moving Image Archivists & Digital Library Federation Hack Day 2013/PlanningHangout 1101

From CURATEcamp
Jump to: navigation, search

Help! Take me back to the Main Page!

Notes from Hack Day check-in Hangout, 11/01/13, 2 pm –The main goal of the Hangout was to review existing projects in order to condense, synergize, and scope-down projects as necessary

General notes re: project ideas:

  • Please remember to bring datasets on removable media, preferably not downloadable from a web server.
  • Projects should ideally be ready to hit the ground running. Ideas not already approved will be given a minute to propose their idea followed by a quick review by the facilitators (though advanced submissions are preferred).
  • Some people may be motivated by more discussion-focused issues of digital preservation which could be held in the form of a break-out “un-conference” session to be determined.
  • Projects need not be contained to a single-day’s work (though scope should still be relatively specific). Developers Lounge/Hackers meeting room will be available during the conference to discuss further efforts or continue to work on projects with groups
  • There is the opportunity to win prizes based on the effectiveness of the project. It’s a jury-based process and presentations will take place on Friday. Winners will be announced at the Saturday morning plenary. More information will be given at Hack Day
  • Not all project proposals need be related to actual software developing or coding e.g. creating manuals or documentation of workflows/tools without existing guidelines, edit-a-thon (wiki shares and forms of web exchange)

General Notes for Participants

There is interest in creating a github share (possibly AMIA-wide) for script recipes, other ideas welcome. Bring laptops!! Paper-based materials will be provided (flip-charts, markers, post-its, etc.) Group projects will ideally be comprised of 5 people. Interest in specific projects should be directed to the sign-up sheet or editing the wiki under “Who’s Interested In This Project”

Existing projects - Re-scoping

Extracting EAI-608/line 21 closed caption info 21 info (Steven Villereal)

Abstract: Intersted in digitization of commercial VHS with captioning to lossless compressed copy, how to walk that info out of the file (with ccestractor) into timed-text format (with Web VTT), could be an issue relevant to HTML5 world and more generally issues of time-based annotation of video.

Possible relation to...

Interactive Video/Transcript Streaming (Chirs McNeave) Abstract: Involves the use of Interactive Video/Trasncript viewer package, an open-source program which also has a component for time-code transcripting, looking to more stably present and provide access given issues with browser capabilities and OS. Steven will do more research to see if some of his work with EAI-608 is relevant to Chris’ project and some of the other snares with OS and browser, the way it’s stored and target format could be explored. There may also be interest in exploring time-stamped text examples from the W3C world, along with issues of web-interfacing,

Possible relation to...

Moving Image Research Collections Digital Video Repository (Ashley Brewer) Abstract: Interested in timecode-based tagging for user-generated metadata, the issue being on user-end input rather than how video material is prepared and streamed. The AMARA social tagging and open-subtitling proposed as potential tools, using time-coded metadata that’s folded into the file. The group could also look at the scripts as a means of creating documentation of ffmpeg, creating recipes for transcode scripts to be shared on github.


Integration of mediainfo generated metadata into a forensic imaging flow (Don Mennerich)

Abstract: Uses Sleuthkit for interacting with file systems, FIWalk can put this information in an xml format for when disks and harddrives are acquired, has a gateway for external programs, when it hits an audiovisual filetype. The desire is to send it through MediaInfo and wrap the media in an xml for future reporting and for more rich metadata capture, the problem is C++ language of the program and the interaction with the MediaInfo library, functionality is challenged in FIWalk. This could be of interest to: people with a knowledge in C++ looking to apply this to other tools.

Possible relation to...

Reconciling filenames with embedded technical metadata/name parameters (Kathryn Gronsbell) Abstract: Interested in creating a tool for adding another level of control at the file level and extract technical metadata to ensure accuracy e.g. access copies on a directory, checking if a .mov search can be specified to quicktime, possible relation to Regex though the concern is more as an output function. Don sees convergence in mapping through his tool and that the relationship of file type and extension are relevant to his project as well.


4. RDFing PBCore (Kara Van Malssen/Karen Cariani)

Abstract: With the emergence of Fedora 4 there is a desire to move towards the RDF schema (does not exist as an expression in PBCore, though has been mapped in EBUCore). This tool for EBUCore could be examined and reverse-engineered for this purpose. Link for this tool to be added to the wiki.

Possible relation to...

Metadata schema developing and mapping tool (Yves Niederhauser) Abstract: Interested in AV metadata standards crosswalks, mapping XML to EBUCore perhaps through MINT (online tool, GUI for custom XSDs, drag-and-drop style sheets for mapping). This could be of interest to organizations that ingest and index these items but no AV-specific metadata. This could also be a good test instance for DPLA metadata map generator, could also relate to RDF mapping. Could be of interest to: Hack Day participants who see gaps in the workflows of their organizations’ mapping projects, or wish to evaluate overall mapping systems.


Digitization workflow development tool (Yves Niederhauser)

Abstract: Project is likely too large in scope but might be helpful to pair with project 7

Possible relation to...

Format/codec evaluation/selection tool (Yves Niederhauser) Abstract: Focuses on decision-making (possible creation of a decision “tree” or workflow diagram in which originating questions/concerns lead to further questions for answering). Migration of media as a deliverable may be too large in scope to explore but a decision-tree would be a good deliverable.


Creating a sample METS for Digitization Project of Analog Audiovisual Collections (JungYun Oh)

Abstract: In order to work with a custom XSD for process metadata in video, the University of Michigan made their own schema but there may be a more standardized example and could work on a METS profile. Lauren (BAVC) is interested in this related to Dance Heritage project with CSV transfer log data (script recipe to inserted on wiki)

Possible relation to...

Produce easy-to-follow documentation for the installation and use of FFMPEG transcoding software (Nash Bly) Could be of interest to: archivists and non-coders who are looking to develop an output “wishlist” for ffmpeg, creating a list of scripts and common profiles would be helpful,