Question-Answering Systems NLP Tools

Datasets, benchmarks, and frameworks for building question answering systems across modalities (open-domain, reading comprehension, commonsense, multilingual). Does NOT include general machine translation, information retrieval, or dialogue systems.

There are 65 question-answering systems tools tracked. 1 score above 50 (established tier). The highest-rated is PaddlePaddle/RocketQA at 54/100 with 785 stars and 22 monthly downloads.

Get all 65 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=question-answering-systems&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 PaddlePaddle/RocketQA

🚀 RocketQA, dense retrieval for information retrieval and question...

54
Established
2 allenai/deep_qa

A deep NLP library, based on Keras / tf, focused on question answering (but...

44
Emerging
3 worldbank/iQual

iQual is a package that leverages natural language processing to scale up...

44
Emerging
4 shuaihuaiyi/QA

使用深度学习算法实现的中文问答系统

44
Emerging
5 seriousran/awesome-qa

😎 A curated list of the Question Answering (QA)

40
Emerging
6 fhamborg/Giveme5W1H

Extraction of the journalistic five W and one H questions (5W1H) from news...

40
Emerging
7 mandarjoshi90/triviaqa

Code for the TriviaQA reading comprehension dataset

38
Emerging
8 programmer290399/pyqna

A simple python package for question answering.

38
Emerging
9 huggingface/node-question-answering

Fast and production-ready question answering in Node.js

37
Emerging
10 TheHamkerCat/python-arq

Asynchronous Python Wrapper For A.R.Q API.

37
Emerging
11 21han/nlp_qa_project

Natural Language Processing Question Answering Final Project

37
Emerging
12 Karthik-Bhaskar/Context-Based-Question-Answering

Context-Based-Question-Answering

35
Emerging
13 UKP-SQuARE/square-core

SQuARE: Software for question answering research.

35
Emerging
14 seominjoon/denspi

Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index (DenSPI)

35
Emerging
15 TmaxEdu/KorDPR

This repo Implements "Dense Passage Retrieval for Open-Domain Question...

35
Emerging
16 allenai/semanticilp

Question Answering as Global Reasoning over Semantic Abstractions (AAAI-18)

34
Emerging
17 dice-group/TeBaQA

A question answering system which utilises machine learning.

34
Emerging
18 neuml/tldrstory

📊 Semantic search for headlines and story text

33
Emerging
19 BDBC-KG-NLP/QA-Survey-CN

北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答(KBQA),基于文本的问答系统(TextQA)...

33
Emerging
20 CogComp/multirc

Reasoning over Multiple Sentences (Multi-RC)

33
Emerging
21 ARBML/qawafi

Platform for Arabic Poetry Analysis using knowledge-based and deep learning...

33
Emerging
22 apple/ml-mkqa

We introduce MKQA, an open-domain question answering evaluation set...

33
Emerging
23 Chia-Hsuan-Lee/KaggleDBQA

Introduction page of a challenging text-to-SQL dataset: KaggleDBQA

30
Emerging
24 anassinator/markov-sentence-correction

Markov Chains and Hidden Markov Models to generate and correct sentences

30
Emerging
25 IBM/sciqa-arcade198-dataset

ARCADE198 Dataset from the ACL 2018 MRQA Workshop

30
Emerging
26 soco-ai/SF-QA

Evaluation framework for open-domain question answering.

30
Emerging
27 stanford-oval/schema2qa

Schema2QA Question Answering Dataset

27
Experimental
28 AskNowQA/QA-Tutorial

The repo contains all the materials related to Question Answering.

27
Experimental
29 RDTvlokip/InfiniQA

The Official InfiniQA Dataset 📁📝

26
Experimental
30 Mukhopadhyay/Amazon_QnA_Dataset

Amazon question/answer dataset.

26
Experimental
31 siddharthkhincha/Inter-IIT-11-Devrev

IIT Guwahati's Gold Medal winning solution to DevRev’s Expert Answers in a...

25
Experimental
32 pln-fing-udelar/newsqa-es

Code to rebuild the NewsQA-es dataset: a Spanish version of the NewsQA dataset

23
Experimental
33 I-QA-UCT/IQA

Extensions to Yuan et al. QAit task.

23
Experimental
34 scruel/campusQA

Deeplearning4J框架搭建的第一个问答小AI

23
Experimental
35 google-research-datasets/query-wellformedness

25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with...

23
Experimental
36 hasanhuz/MentalQA

MentalQA: An Annotated Arabic Corpus for Questions and Answers of Mental Healthcare

23
Experimental
37 christianbitter/QA_and_QG

An inventory of data sets around Question Generation and Question Answering

22
Experimental
38 boostcampaitech3/level2-mrc-level2-nlp-09

네이버 부스트캠프 | Open-Domain Question Answering(ODQA)

21
Experimental
39 Chia-Hsuan-Lee/ODSQA

ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET

21
Experimental
40 di37/question-answering-api-llm

Question Answering System API based on all of the Harry Potter Books that...

21
Experimental
41 boostcampaitech3/final-project-level3-nlp-09

네이버 부스트캠프 | 회의록을 활용한 Closed-Domain Question Answering(CDQA)

20
Experimental
42 reddrex/lingcomp_QA

An Spanish computational linguistics QA corpus (JSON format) with 1004 rows

19
Experimental
43 donderom/sqwat

TUI editor for the Stanford Question Answering Dataset (SQuAD) 💬

19
Experimental
44 ZhiyunLab/CsQA

CommonsenseQA

19
Experimental
45 youngerous/Open-domain-QA

Presentation slides of ODQA

19
Experimental
46 boostcampaitech2/mrc-level2-nlp-09

[2nd] KLUE Open-Domain Question Answering

19
Experimental
47 louisowen6/quora_paraphrasing_id

Quora Paraphrasing Dataset Bahasa Indonesia Version

18
Experimental
48 spapicchio/QATCH

Official implementation of QATCH: Benchmarking SQL-centric tasks with Table...

18
Experimental
49 AkariAsai/unanswerable_qa

The official implementation for ACL 2021 "Challenges in Information Seeking...

16
Experimental
50 orsiluk/Answer-Ranking

Model to find relevant answers to questions on CQA (Community Question...

16
Experimental
51 SockAndSandal/semantic-search-qa

Code for the Semantic Search QA Algorithm

15
Experimental
52 Dalia-Mahmoud-ElSayes/Gp-2022-ma3aref-Arabic-QA

Our Graduation project: "Ma'aref" an Arabic Question Answering on Quran and Fatwa.

15
Experimental
53 Wikidepia/SQuAD-id

Stanford Question Answering Dataset Translated to Indonesia.

15
Experimental
54 Sparshjain25/SQuAD-2.0

NLP Team 12

15
Experimental
55 motazsaad/Quran-QA

Quran QA

14
Experimental
56 gsh199449/productqa

Product-Aware Answer Generation in E-Commerce Question-Answering

14
Experimental
57 santhoshtr/wq

An experimental natural language based querying system for Wikipedia

14
Experimental
58 asaparov/fictionalgeoqa

Question-answering dataset to evaluate reasoning ability over short paragraphs.

13
Experimental
59 svjack/tableQA-Chinese

Unsupervised tableQA and databaseQA on chinese finance question and tabular data

12
Experimental
60 ASoleimaniB/NLQuAD

NLQuAD: A Non-Factoid Long Question Answering Data Set. To be published at EACL2021

12
Experimental
61 aklein4/ASKiT

Stanford CS224N Final Project. A text-based multi-hop reasoning...

12
Experimental
62 mkearney/infoquality

Information Quality

11
Experimental
63 felixgiov/UDST-DurationQA

Dataset from the paper "Improving Event Duration Question Answering by...

10
Experimental
64 GUT-AI/qa

Question Answering (QA)

10
Experimental
65 lucadiliello/asnq-challenging

ASNQ without trivial negative answers.

10
Experimental