Tools

 

Included Tools

The latest version of FITS is configured to use these tools for identifying, validating, and extracting technical metadata:

  • ADL Tool
    • Maintenance organization:Harvard Library
    • Formats supported: Audio Decision List files
  • Apache Tika
  • Droid
    • Capabilities: Identifies file formats
    • Maintenance organization: UK National Archives
    • Details: DROID is written in Java. The FITS tool wrapper uses the provided API. The output is converted into a simple XML document and then converted to FITS XML using xml/droid/droid_to_fits.xslt. The DROID configuration file and signature file are located in the tools/droid directory.
    • Formats supported: many (> 1,000) - Listed in the DROID signature file
  • ExifTool
    • Capabilities: Identifies and extracts technical metadata.
    • Maintenance: Phil Harvey
    • Details: Exiftool is written in Perl. A windows executable is also provided. The Exiftool tool wrapper detects the operating system type and calls the appropriate version of the tool. The tab-delimited output is captured, converted to a simple XML structure, and then converted to FITS XML using xslt. xml/exiftool/exiftool_xslt_map.xml is used to determine which XSLT to apply for the given identified format.
    • Formats Supported: jpg,tiff,jp2,gif,bmp,png,psd,dng,wav,mp3,mp4,m4a,aiff,rm,ogg,flac,xml,html,pdf,doc
  • ffident
    • Capabilities: Identifies file formats.
    • Maintenance: no longer maintained - the old website is archived here: http://web.archive.org/web/20061106114156/http://schmidt.devlib.org/ffident/index.html
    • Details: FFIdent is written in Java. The FITS tool wrapper uses the provided API. Output is converted into a simple XML document and then converted to FITS XML using xml/ffident/ffident_to_fits.xslt.
    • Formats Supported: Listed in the configuration file tools/ffident/formats.txt
  • File utility (windows port)
    • Capabilities: Identifies files.
    • Details: File Utility is usually bundled with Linux, UNIX and OS X. The GnuWin32? port is provided for use on Windows. Due to variations in versions this may cause different output when run on different platforms. File Utility is called in its default mode (no arguments), and also with -i to determine the MIME type. The output is converted into a simple XML document and then converted to FITS XML using xml/fileutility/fileutility_to_fits.xslt
    • Formats Supported: many (> 1,000)
  • Jhove
    • Capabilities: Identifies, extracts technical metadata, and validates files.
    • Maintenance: Open Preservation Foundation. See: http://jhove.openpreservation.org/
    • Details: JHOVE is written in Java. The FITS tool wrapper uses the provided API. The JHOVE XML output is converted to FITS XML using XSLT. xml/jhove/jhove_xslt_map.xml is used to determine which XSLT to apply for the given identified format.
    • Formats Supported: jpg,tiff,jp2,gif,wave,aiff,xml,html,ascii,utf-8,pdf.
    • Notes:
      • For JP2 files the JHOVE output element Transformation indicates whether the compression is lossy or lossless. The transformation values are described in Table A-20 of the JPEG2000 part 1 specification. A value of 0 maps to the 9-7 irreversible (lossy) filter. A value of 1 maps to 5-3 reversible (lossless) filter. This JHOVE element is used by FITS when it outputs the compressionScheme in the image metadata, writing it as JPEG 2000 Lossy or JPEG 2000 Lossless.
      • JHOVE does not validate the codestream but it checks the file structure.
  • MediaInfo
    • Capabilities: Identifies and extracts technical metadata for video files. 
    • Maintenance: MediaArea.net
    • Details: The MediaInfo API is written in C++ and is called via Java by using the JNA library. For more details on JNA, see: https://github.com/java-native-access/jna
    • Formats Supported: Although MediaInfo supports many video formats, FITS will only support the following video formats and wrappers out of the box:
      avi, mov, mpg, mpeg, mkv, mp4, mxf, ogv, mj2, divx, dv, m4v, m2v, ism.

  • National Library of New Zealand (NLNZ) Metadata Extractor Tool
    • Capabilities: Identifies and extracts technical metadata.
    • Maintenance organization: National Library of New Zealand
    • Details: The FITS NLNZ tool wrapper uses the provided Java API. The NLNZ native XML output is converted to FITS XML using XSLT. xml/nlnz/fits/nlnz_xslt_map.xml is used to determine which XSLT to apply to the given identified format.
    • Formats Supported: jpg,tiff,gif,bmp,wav,mp3,xml,html,pdf,doc,wordperfect,msworks,odt
    • Source code on GitHub here.
  • OIS Audio Information
    • Maintenance organization:Harvard Library
    • Formats supported: audio
  • OIS File Information
    • Capabilities: Extracts technical metadata.
    • Maintenance organization:Harvard Library
    • Details: FileInfo creates FITS XML natively. It determines basic file information like file name, size, file system last modified date, and md5 checksums. It uses the fast md5 jar from http://www.twmacinta.com/myjava/fast_md5.php. Written by Harvard Library.
    • Formats Supported: Any
  • OIS XML Information
    • Capabilities: Identifies and extracts technical metadata.
    • Maintenance organization:Harvard Library
    • Details: XmlMetadata creates FITS XML natively. Its sole purpose is to identify XML and parse out the default namespace and schema location. This is used for FITS text metadata. Written by Harvard Library.
    • Formats Supported: XML

In addition FITS includes the following open source libraries (among others):

Potential Tools to Add to FITS

Other tools that might be a good fit for FITS: