Homepage » Catalogue » Darmstadt Knowledge Processing Software Repository (DKPro Core)

Darmstadt Knowledge Processing Software Repository (DKPro Core)

Abstract:
DKPro Core is a collection of Apache UIMA-based software components for Natural Language Processing (NLP). Many powerful NLP components are already freely available and new components are continuously developed and published. These components cover the whole spectrum of tasks in Language Technology. DKPro Core contains wrappers for third party tools as well as NLP components which are developed directly at the UKP Lab. DKPro Core offers such NLP components for a number of different application areas like linguistic preprocessing, information retrieval or semantic text analytics for English or German. DKPro Core is built on tools like uimaFIT, which allows a fast and simple development of NLP-pipelines. DKPro Core is available with two license options: Apache Software License (ASL) Version 2 or GNU Public License Version 3, with the latter one containing additional components.
Project / Company:
DKPro Core
Payment:
free of cost
Support:
Yes
License model:
GNU/GPL (General Public License)
Possibilitie(s) for modification:
Depending on the license model the source code can be modified or extended.
Use external software / services:
DKPro Core uses (depending on the license) tools and libraries of third party providers.
Reuse:
DKPro Core is the foundation for miscellaneous projects at UKP and is available to everyone interested.
Architecture (Text):
Apache UIMA offers a component-based architecture in which documents can be imported and analyzed by a configurable processing pipeline. DKPro Core offers a construction kit for harmonized processing components which are able to model a variety of analysis scenarios.
Author(s):
  • Michael Matuschek
  • Type:
  • Software
  • Application type(s) functional:
  • (collaborative) editing of publications and research data
  • (meta) search & browsing
  • Scope in e-publishing process:
  • (collaborative) editing of publications and research data
  • (meta) search & browsing
  • Digital object type(s):
  • documents
  • Application type(s) technical:
  • Program Library
  • Specifical for area(s) of studies:
  • Applied Linguistics (ddc:418)
  • Language, Linguistics (ddc:400)
  • Supported language(s):
  • (Freely configurable)
  • Technology format(s):
  • Apache UIMA
  • ASCII-Text (TXT)
  • HTML
  • uimaFIT
  • XML
  • Programming language(s):
  • Java

  • Last update:
    30.07.2012 (14:07:29)
    Creation date:
    24.11.2011 (09:11:35)

    This work or content is licensed under a Creative Commons Licence (CC BY-NC-SA 3.0). Diese Webseite erfasst anonymisierte Daten zur Nutzung. Sie können dies für die Dauer ihres Besuchs unterbinden, indem sie folgenden Link klicken: Deactivate Session Cookie