FastText Serving Wrappers NLP Tools

Language-agnostic wrappers, bindings, and HTTP servers for deploying and serving fastText models across different platforms and runtimes. Does NOT include fastText model training, other text classification libraries, or general embedding services.

There are 38 fasttext serving wrappers tools tracked. 2 score above 50 (established tier). The highest-rated is ChenghaoMou/text-dedup at 69/100 with 746 stars and 695 monthly downloads. 1 of the top 10 are actively maintained.

Get all 38 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=fasttext-serving-wrappers&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 ChenghaoMou/text-dedup

All-in-one text de-duplication

69
Established
2 loretoparisi/fasttext.js

FastText for Node.js

62
Established
3 winkjs/wink-jaro-distance

An Implementation of Jaro Distance Algorithm by Matthew A. Jaro

49
Emerging
4 gagan3012/PolyDeDupe

PolyDeDupe: Multi-Lingual Data Deduplication

49
Emerging
5 messense/fasttext-serving

fastText model serving service

49
Emerging
6 vrasneur/pyfasttext

Yet another Python binding for fastText

45
Emerging
7 shner-elmo/flashtext2

The fastest FlashText library for Python

44
Emerging
8 olegtarasov/FastText.NetWrapper

.NET Standard wrapper for fastText library. Now works on Windows, Linux and MacOs!

43
Emerging
9 oscar-project/ungoliant

:spider: The pipeline for the OSCAR corpus

41
Emerging
10 jazzyarchitects/fasttext-node

Node wrapper around FastText Library

40
Emerging
11 vunb/node-fasttext

Nodejs binding for fasttext representation and classification.

35
Emerging
12 aplmikex/deduplication_mnbvc

文本去重

34
Emerging
13 waltsmith88/go-flashtext

Go-flashtext is a flashtext implement written in Go (Golang). It is based on...

31
Emerging
14 davidmenger/fast-text

Prediction and nearest neighbour tools from Facebook Fast Text wrapped into...

30
Emerging
15 jianlins/FastContext

FastContext is an optimized Java implementation of ConText algorithm...

29
Experimental
16 go-air/dupi

A tool to find all duplicates in large sets of text documents.

28
Experimental
17 Edgaras0x4E/StrSim

Collection of string similarity and distance algorithms in PHP including...

28
Experimental
18 oscar-project/goclassy

An asynchronous concurrent pipeline for classifying Common Crawl based on...

27
Experimental
19 soaxelbrooke/phrase

A tool for learning significant phrase/term models, and efficiently labeling...

26
Experimental
20 shner-elmo/flashtext2-rs

Flashtext implementation in Rust

26
Experimental
21 raypereda/shuffle

a tool for shuffling lines of text

24
Experimental
22 yunsii/fasttext.wasm.js

Node and Browser env supported WebAssembly version of fastText: Library for...

23
Experimental
23 siara-cc/FastText_Lang_bindings

Wrappers for FastText Library used for fast text representation and classification.

22
Experimental
24 loretoparisi/fasttext.py

FastText Pytorch version

21
Experimental
25 1712n/dedup-service

A high-performance service designed to eliminate duplicate and...

21
Experimental
26 mabdh/go-fasttext

🗚🐀 serving fastText model with golang

20
Experimental
27 rhnvrm/textsimilarity

go package that provides similarity between two string documents using...

19
Experimental
28 proycon/sesdiff

Generates a shortest edit script (Myers' diff algorithm) to indicate how to...

17
Experimental
29 karmdesai/fastTextWeb

fastTextWeb is a custom version of Facebook's text classification library...

16
Experimental
30 unhammer/fastText-haskell

haskell bindings to fastText

15
Experimental
31 mcthulhu1/go-text-router

Text classification HTTP service in Go — TF-IDF + Naive Bayes + routing

14
Experimental
32 mtnmunuklu/lescatit

Provides to crawl and categorize URL addresses

14
Experimental
33 PyDataBlog/SimString.jl

Native Julia implementation of CPMerge (SimString) algorithm

13
Experimental
34 eddielin0926/cjkfuzz

CJKfuzz is a Python library for supporting fuzzy matching chinese string.

12
Experimental
35 duranbe/lev

Levenshtein distance function as C Extension for Python 3

12
Experimental
36 4AI/generative_deduplication

Code for Generative Deduplication For Socia Media Data Selection (Findings...

12
Experimental
37 innerNULL/osimhash

A deduplication lib built Over [SIMHASH](https://github.com/yanyiwu/simhash).

11
Experimental
38 livelace/girie

girie ("go" + "kirie") is a tool for data/metadata extraction from web pages.

10
Experimental