allenai/bi-att-flow

Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to achieve a query-aware context representation without early summarization.

/ 100

Established

The implementation targets machine reading comprehension on the SQuAD dataset using TensorFlow r0.11, with GloVe embeddings and NLTK preprocessing. Training involves ~2.5M parameters requiring 12GB+ GPU memory (convergence at ~18k steps), while multi-GPU parallelization is supported for distributed inference across multiple cards. Pre-trained weights are available via CodaLab for reproducible evaluation against official metrics, achieving 77.3% F1 on single models and 80.7% on ensembles.

1,540 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

1,540

Forks

672

Language

Python

License

Apache-2.0

Related tools

carpedm20/MemN2N-tensorflow

"End-To-End Memory Networks" in Tensorflow

domluna/memn2n

End-To-End Memory Network using Tensorflow

HKUST-KnowComp/R-Net

Tensorflow Implementation of R-Net

localminimum/QANet

A Tensorflow implementation of QANet for machine reading comprehension

YerevaNN/Dynamic-memory-networks-in-Theano

Implementation of Dynamic memory networks by Kumar et al. http://arxiv.org/abs/1506.07285

Explore NLP Tools

All categories Trending NLP directory Insights