songlab-cal/tape

Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology.

43
/ 100
Emerging

Provides pretrained protein language models (BERT-style Transformer, UniRep) integrated with HuggingFace transformers API for easy model loading and inference. Includes a modular benchmark suite with five downstream tasks—secondary structure prediction, contact prediction, remote homology detection, fluorescence, and stability prediction—plus utilities for generating sequence embeddings via `tape-embed` command with automatic GPU distribution. The codebase migrated from TensorFlow to PyTorch but recommends external frameworks (PyTorch Lightning, Fairseq) for training rather than native training utilities.

733 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 9 / 25
Community 24 / 25

How are scores calculated?

Stars

733

Forks

133

Language

Python

License

BSD-3-Clause

Last pushed

Dec 11, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/songlab-cal/tape"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.