Question Answering Systems Embedding Tools

Tools for building QA systems that retrieve answers from text, knowledge bases, or documents using semantic search and embeddings. Does NOT include general conversational chatbots, LLM APIs, or fact-checking as a standalone task.

There are 24 question answering systems tools tracked. 1 score above 70 (verified tier). The highest-rated is docarray/docarray at 76/100 with 3,117 stars and 145,810 monthly downloads.

Get all 24 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=question-answering-systems&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 docarray/docarray

Represent, send, store and search multimodal data

76
Verified
2 primeqa/primeqa

The prime repository for state-of-the-art Multilingual Question Answering...

50
Established
3 CogStack/CogStack-Pipeline

Distributed, fault tolerant batch processing for Natural Language...

35
Emerging
4 danielfrees/scrapemed

ScrapeMed: Data scraping for PubMed Central.

33
Emerging
5 ekatraone/Mobius-v1

Ekatra QnA is a student-focused intelligent search engine that enables them...

28
Experimental
6 algoprog/Quin

An easy to use framework for large-scale fact-checking and question answering

28
Experimental
7 ML-Recipes/BERT-FAQ

Python-based toolkit for building and evaluating a transformer-based FAQ...

23
Experimental
8 achimoraites/simple-question-answering-ml-system

A simple question and answering system with semantic search

23
Experimental
9 AniruddhaPKawarase/agentic-ingestion-pipeline

Document chunking, embedding & FAISS indexing pipeline with resume-on-failure

22
Experimental
10 pyZac/6fe-ai-bot

NLP data pipeline for semantic knowledge retrieval | Arabic text processing...

16
Experimental
11 HamBa-m/sci-scraper

A Python-based web scraping tool for collecting scientific literature from...

16
Experimental
12 abhishek-kathuria/Question-Answering-using-Capsule-Networks

A novel system for the automated question & answering of product-related...

15
Experimental
13 lea-dieudonat/financial_extraction

An NLP & ontology-based pipeline that makes large collections of financial...

14
Experimental
14 manjul5x/dataops-kb

Spark pipeline + AI-powered knowledge base for Airflow error resolution —...

14
Experimental
15 niteshredddy/ai_dietitian

NutriVision Pro: A multimodal AI dietitian featuring YOLOv11 vision, Neural...

14
Experimental
16 KaifAhmad1/Awesome-NLP-and-IR

A comprehensive resource for NLP and IR enthusiasts, covering topics from...

12
Experimental
17 nava2105/soce_scraper

soce_scraper is a web application designed to extract procurement data from...

12
Experimental
18 Hardhika05/finance-ai-platform

AI-powered financial knowledge system with real-time data ingestion,...

11
Experimental
19 fabri44k/Answer-with-web

Use an LLM and web content to answer a user's question.

11
Experimental
20 EndlessReform/SphinxLM

A general assistant for query inversion

11
Experimental
21 niranjanxprt/frequenz-ai-native-task

AI-powered knowledge extraction and visualization system with semantic...

11
Experimental
22 jameswniu/nlp-clinical-trial-eligibility-screening

A modular system for evaluating patient eligibility against clinical trial...

11
Experimental
23 jo-valer/fact-checking-ita-abstention

Official repository of our AILC CLiC-it 2023 paper When You Doubt, Abstain:...

10
Experimental
24 MondecaLabs/FactTriplesChecker

Fact triples validator work made for the ISWC 2019 challenge - Task 1

10
Experimental