TERESAH teresah.dasish.eu
HistoryOnline www.history.ac.uk
Tika

Available Data

Application Category
4 Analysis
Keyword
Active development, Advanced, Offline, Single purpose
Type
Tool
Url
https://tika.apache.org/

Available Data Formats

RDF/JsonLD RDF/Turtle RDF/XML

Available Data

Description
Tika is a Java-based text extraction tool which is primarily aimed at handling data in unknown formats; it extracts metadata and structured content from unknown texts, and can also detect the language of the text. A feature of the tool is that it is designed to work with third-party parsers, so that developers can integrate Tika with parsers they are already familiar with.

Available Data Formats

RDF/JsonLD RDF/Turtle RDF/XML

Similar Tools