NatLibFi/Annif
Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.
Supports pluggable machine learning backends (fastText, Omikuji, neural networks) and multiple NLP analyzers, exposed via REST API and web UI for flexible deployment. Built in Python with modular architecture enabling custom algorithm combinations and controlled vocabulary integration (e.g., Finto thesauri). Trained on library metadata corpora and actively benchmarked against competitive indexing tasks through shared evaluation challenges.
253 stars and 312 monthly downloads. Available on PyPI.
Stars
253
Forks
45
Language
Python
License
—
Category
Last pushed
Mar 03, 2026
Monthly downloads
312
Commits (30d)
0
Dependencies
17
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/NatLibFi/Annif"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
explosion/displacy
:boom: displaCy.js: An open-source NLP visualiser for the modern web
microsoft/browsecloud
A web app to create and browse text visualizations for automated customer listening.
microsoft/VisTalk
A JavaScript toolkit for Natural Language-based Visualization Authoring
DeNeutoy/spacy-vis
A visualisation tool for Spacy using Hierplane.
impresso/impresso-frontend
🚀 The frontend application of the Impresso WebApp