File identification software
From Just Solve the File Format Problem
(Difference between revisions)
(Added categories) |
Dan Tobias (Talk | contribs) |
||
Line 11: | Line 11: | ||
* [[FIDO]] (cross-platform, open source) [http://www.openplanetsfoundation.org/software/fido website]: Format Identification for Digital Objects, written in [[Python]]. | * [[FIDO]] (cross-platform, open source) [http://www.openplanetsfoundation.org/software/fido website]: Format Identification for Digital Objects, written in [[Python]]. | ||
* [[FIDOO]] (web-based online file identification): [http://www.techmaurice.com/fidoo/ website] | * [[FIDOO]] (web-based online file identification): [http://www.techmaurice.com/fidoo/ website] | ||
− | * [[File command]] (various implementations): a standard Unix command, found on almost all Unix and Unix-like (i.e., Linux) systems. See the [http://manpages.debian.net/cgi-bin/man.cgi?query=file&apropos=0&sektion=0&manpath=Debian+6.0+squeeze&format=html&locale=en Debian man page] for an overview. | + | * [[File command]] (various implementations): a standard Unix command, found on almost all Unix and Unix-like (i.e., Linux) systems. See the [http://manpages.debian.net/cgi-bin/man.cgi?query=file&apropos=0&sektion=0&manpath=Debian+6.0+squeeze&format=html&locale=en Debian man page] for an overview, and [http://openpreservation.org/blog/2012/08/09/magic-editing-and-creation-primer/ this] guide to creating "magic" entries for it. |
* [[File Information Tool Set]]: software from the Harvard University library to identify file formats and extract metadata | * [[File Information Tool Set]]: software from the Harvard University library to identify file formats and extract metadata | ||
*[[FI Tools]] (Windows, commercial, [http://www.forensicinnovations.com/fitools.html website]) | *[[FI Tools]] (Windows, commercial, [http://www.forensicinnovations.com/fitools.html website]) |
Revision as of 18:50, 27 March 2016
Software | > | File identification software |
Software that automates the process of Identifying Files.
- Apache Tika (cross-platform, open source, website): "The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries." Written in Java.
- DROID (cross-platform, open source, website): "DROID is a software tool developed by The National Archives [of the United Kingdom] to perform automated batch identification of file formats." Requires Java 7 or 8 (Version 6.1.5).
- FIDO (cross-platform, open source) website: Format Identification for Digital Objects, written in Python.
- FIDOO (web-based online file identification): website
- File command (various implementations): a standard Unix command, found on almost all Unix and Unix-like (i.e., Linux) systems. See the Debian man page for an overview, and this guide to creating "magic" entries for it.
- File Information Tool Set: software from the Harvard University library to identify file formats and extract metadata
- FI Tools (Windows, commercial, website)
- G-Spot (Windows, freeware, website): Identifies audio and video codecs need to play a media file.
- JHOVE (tool to classify/identify/validate file formats)
- MediaInfo (cross-platform, open source, website): "MediaInfo is a convenient unified display of the most relevant technical and tag data for video and audio files."
- PHP PRONOM drip: Recognize file formats using PRONOM registry (open source, website)
- Siegfried (signature-based file identification tool) website blog post
- TrID (Windows/Linux, free for non-commercial use, website): identifies files using a database of filetype signatures. Also has an online version.