Kvasirs/MILES
MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert, MILES uses the bert-base-multilingual-uncased model, as well as simple language-agnostic approaches to complex word identification (CWI) and candidate ranking.
The system pipeline applies masked language modeling to generate simplification candidates, then ranks them using frequency-based scoring and optional fastText word embeddings for semantic similarity. It provides both a Flask web interface and CLI tools for single-sentence or batch file processing across 22 languages, with optional fine-tuning capabilities through custom fastText embeddings for improved accuracy in target languages.
No commits in the last 6 months.
Stars
50
Forks
7
Language
Python
License
—
Category
Last pushed
May 03, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Kvasirs/MILES"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
zhihu/cuBERT
Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL
dimitreOliveira/bert-as-a-service_TFX
End-to-end pipeline with TFX to train and deploy a BERT model for sentiment analysis.
ThalesGroup/ConceptBERT
Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering
kpi6research/Bert-as-a-Library
Bert as a Library is a Tensorflow library for quick and easy training and finetuning of models...
Statistical-Impossibility/Feline-Project
Domain-adaptive NLP pipeline for feline veterinary NER using BERT