Skip to content
muelletm edited this page Mar 16, 2015 · 1 revision

Cistern is the principal repository of tools and resources released by the Center for Information and Language Processing ([http://www.cis.uni-muenchen.de/ CIS]) of the University of Munich ([http://www.uni-muenchen.de/ LMU]).

The CIS conducts research on linguistically-informed statistical natural language processing ([http://en.wikipedia.org/wiki/Natural_language_processing NLP]) including problems such as part-of-speech tagging, parsing and sentiment analysis.

== CIS Tools ===

  • [CoSimRank CoSimRank] - a fast and accurate graph based similarity measure
  • [HMMLA HMMLA] - an implementation of Hidden Markov Models with Latent Annotations
  • [marmot MarMoT] - a fast and accurate morphological tagger
  • [marlin MarLiN] - a fast word clustering tool
  • [Ocrocis] - a project manager for the OCR toolkit Ocropy by Thomas Breuel
  • [SFST SFST] - a finite state transducer toolkit
  • [SMOR SMOR] - a German computational morphology

== CIS Resources ===

  • [CoreferenceChains CoreferenceChains] - automatically extracted coreference chains from English Gigaword data
  • [Antonyms Antonyms] - list of word-antonym pairs
  • [robusttagging Robust Tagging] - resources for robust morphological tagging
Clone this wiki locally