To main content

Zootology - Toolkit for Semantic Data Enrichment and Reconciliation

Toolkit for Semantic Data Enrichment and Reconciliation

Contact person

Zootology is a collection of modular tools designed to support semantic data enrichment and reconciliation. Its core capabilities include:

  • Entity linking
  • Semantic typing
  • Ontology-based classification
  • Human-in-the-loop validation

Zootology can be integrated with others tools as services through APIs. The toolkit consists of the following tools.

Crocodile

Purpose: High‑performance entity linking engine for large tabular datasets.

Key features:

  • Row‑level parallel processing
  • Batch entity linking
  • Hybrid heuristics + ML ranking

GitHub: https://github.com/enRichMyData/crocodile

LionLinker

Purpose: Scalable enrichment engine for large datasets

Key features:

  • Context‑aware entity linking

  • Candidate retrieval from large KGs

  • Multi‑model re‑ranking

  • Confidence scoring + NIL detection

GitHub: https://github.com/enRichMyData/lion_linker

Moose

Purpose: Ontology‑driven classification & semantic typing.

Key features: 

  • Simplified semantic typing
  • Ontology‑based classification
  • NER‑type classification
  • Human‑interpretable taxonomy

GitHub: https://github.com/roby-avo/moose

Koala-UI

Purpose: Human‑in‑the‑loop interactive validation UI.

Key features:

  • Semantic annotation UI for tabular data
  • Visual inspection of entity linking
  • Column type annotation (NER / literal)
  • DPV‑based column classification

GitHub: https://github.com/enRichMyData/koala_ui

Explore research areas