harmonydata/harmony
The Harmony Python library: a research tool for psychologists to harmonise data and questionnaire items. Open source.
Leverages large language models and spaCy for semantic matching of questionnaire items across languages, with PDF/Excel/Word parsing via Apache Tika. Provides both a Python library and web interface, supporting programmatic workflows through instrument creation from various file formats or string lists. Includes optional remote spaCy server deployment for lightweight installations, enabling scalable harmonisation workflows in research environments.
Stars
52
Forks
54
Language
Python
License
MIT
Category
Last pushed
Mar 12, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/harmonydata/harmony"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Related tools
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
yannvgn/laserembeddings
LASER multilingual sentence embeddings as a pip package
embeddings-benchmark/results
Data for the MTEB leaderboard
MilaNLProc/honest
A Python package to compute HONEST, a score to measure hurtful sentence completions in language...
fresh-stack/freshstack
This repository helps you evaluate your models on the FreshStack benchmark!