Apache Tika™ 2.1.0 is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

Copyright notice - License terms