castorini/hedwig
PyTorch deep learning models for document classification
Implements multiple neural architectures including DocBERT, Hierarchical Attention Networks, and character-level CNNs, with support for extreme multi-label classification tasks. Models leverage pre-trained word2vec embeddings and NLTK preprocessing, designed to work with benchmark datasets (Reuters, AAPD, IMDB) across document and sentence-level classification. Built on PyTorch 0.4 with modular architecture enabling direct comparison of different deep learning approaches for text classification.
596 stars. No commits in the last 6 months.
Stars
596
Forks
127
Language
Python
License
Apache-2.0
Category
Last pushed
Jul 21, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/castorini/hedwig"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
davidberenstein1957/classy-classification
This repository contains an easy and intuitive approach to few-shot classification using...
inspirehep/magpie
Deep neural network framework for multi-label text classification
AnubhavGupta3377/Text-Classification-Models-Pytorch
Implementation of State-of-the-art Text Classification Models in Pytorch
kk7nc/Text_Classification
Text Classification Algorithms: A Survey
dayyass/text-classification-baseline
Pipeline for fast building text classification TF-IDF + LogReg baselines.