Zootology is a collection of modular tools designed to support semantic data enrichment and reconciliation. Its core capabilities include:
- Entity linking
- Semantic typing
- Ontology-based classification
- Human-in-the-loop validation
Zootology can be integrated with others tools as services through APIs. The toolkit consists of the following tools.
Crocodile
Purpose: High‑performance entity linking engine for large tabular datasets.
Key features:
- Row‑level parallel processing
- Batch entity linking
- Hybrid heuristics + ML ranking
GitHub: https://github.com/enRichMyData/crocodile
LionLinker
Purpose: Scalable enrichment engine for large datasets
Key features:
-
Context‑aware entity linking
-
Candidate retrieval from large KGs
-
Multi‑model re‑ranking
-
Confidence scoring + NIL detection
GitHub: https://github.com/enRichMyData/lion_linker
Moose
Purpose: Ontology‑driven classification & semantic typing.
Key features:
- Simplified semantic typing
- Ontology‑based classification
- NER‑type classification
- Human‑interpretable taxonomy
GitHub: https://github.com/roby-avo/moose
Koala-UI
Purpose: Human‑in‑the‑loop interactive validation UI.
Key features:
- Semantic annotation UI for tabular data
- Visual inspection of entity linking
- Column type annotation (NER / literal)
- DPV‑based column classification