koomri/text-segmentation

Implementation of the paper: Text Segmentation as a Supervised Learning Task

33
/ 100
Emerging

Implements supervised neural segmentation using sentence embeddings (max pooling strategy) trained on wiki-727K and Choi datasets with word2vec representations. Built on PyTorch with configurable model architectures, includes dataset preprocessing pipelines for Wikipedia dumps and evaluation metrics via segeval. Provides CLI tools for training, evaluation, and custom dataset generation with TensorBoard logging support.

265 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 1 / 25
Community 22 / 25

How are scores calculated?

Stars

265

Forks

57

Language

Python

License

Last pushed

Oct 02, 2019

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/koomri/text-segmentation"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.