suraj5424/Protein-sequence-analysis
This notebook does the analysis of a protein sequence dataset in FASTA format. It addresses challenges in data import, alignment, embedding, and classification using bioinformatics tools and machine learning techniques. The solution provides a systematic approach to extract insights from structured data, crucial for bioinformatics research.
No commits in the last 6 months.
Stars
3
Forks
—
Language
Jupyter Notebook
License
—
Category
Last pushed
Jun 14, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/suraj5424/Protein-sequence-analysis"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
DeepChainBio/bio-transformers
bio-transformers is a wrapper on top of the ESM/Protbert model, trained on millions on proteins...
Rostlab/bindPredict
Prediction of binding residues for metal ions, nucleic acids, and small molecules.
bio-ontology-research-group/predCAN
Ontology-based prediction of cancer driver genes
MI2DataLab/memr
R package for Multisource Embeddings for Medical Records
KCLabMTU/LMSuccSite
Improving Protein Succinylation Sites Prediction Using Features Extracted from Protein Language Model