CineFiles image file workflows

Initial ideas based on Deployment team discussion on July 16, 2013.  

Current workflows

  • PFA digitization staff send TIFFs to Glen (via FTP to where?)
  • Glen sends TIFFs to panamint (leased unix server) as dark archive.
  • Glen runs scripts that check file integrity (need to identify all the checks done here) and create derivatives.
  • Full size and thumbnail JPEG derivatives SCP'd to the webfarm server that hosts CineFiles content and the web application.  Files saved into a structured directory system that can be accessed by the CineFiles web application.  The image file names are important here (document ID plus page number).  (Specifically, image files are sent to the dev instance of the web farm where scripts grab them and put them in the right place.)

Assumptions

We will be importing the existing set of original CineFiles images (mostly TIFFs but some GIFs) into the CineFiles-CSpace instance.  This will create derivatives that could be used on the CineFiles web site, but it is likely that we will not replace the current web application content.  New image content would go through the workflow being developed below

Proposed workflow

  • An event handler/batch process (or nightly cron job?) will copy new and modified blob content (specify which derivatives: OriginalJpeg and Thumbnail?) to the web farm dev instance.
  • Logic either in the event handler or existing code in the web farm will need to perform the checks currently performed by Glen's scripts (file naming, page numbering, corrupt files, and so on).
  • There might be issues here with the display sizes of the derivatives created by Nuxeo via ImageMagick.  

Alternative workflow

Alternatively, CineFiles could be refactored to look at images on CSpace via the PublicItem service and/or the image server application written by John Lowe.  Implications of this:

  • Probably need to speed up image display from CollectionSpace (which we might want to do anyway)
  • This would be some significant work in the CineFiles code base.