Kvasirs/MILES

MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert, MILES uses the bert-base-multilingual-uncased model, as well as simple language-agnostic approaches to complex word identification (CWI) and candidate ranking.

/ 100

Experimental

The system pipeline applies masked language modeling to generate simplification candidates, then ranks them using frequency-based scoring and optional fastText word embeddings for semantic similarity. It provides both a Flask web interface and CLI tools for single-sentence or batch file processing across 22 languages, with optional fine-tuning capabilities through custom fastText embeddings for improved accuracy in target languages.

No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

zhihu/cuBERT

Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL

dimitreOliveira/bert-as-a-service_TFX

End-to-end pipeline with TFX to train and deploy a BERT model for sentiment analysis.

ThalesGroup/ConceptBERT

Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering

kpi6research/Bert-as-a-Library

Bert as a Library is a Tensorflow library for quick and easy training and finetuning of models...

Statistical-Impossibility/Feline-Project

Domain-adaptive NLP pipeline for feline veterinary NER using BERT

Explore ML Frameworks

All categories Trending ML Framework directory Insights