SomanthaManuranga/QuantumCircuit-Dataset-Builder-Automated-Image-to-Text-Pipeline
This project builds an automated pipeline to generate a high-quality dataset of quantum circuit diagrams and metadata from arXiv “quant-ph” papers. It addresses the near-zero accuracy of general image-to-text models on specialized technical diagrams by enabling domain-specific training data.
Stars
1
Forks
—
Language
Jupyter Notebook
License
—
Category
Last pushed
Feb 10, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/SomanthaManuranga/QuantumCircuit-Dataset-Builder-Automated-Image-to-Text-Pipeline"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
fidelity/textwiser
[AAAI 2021] TextWiser: Text Featurization Library
RandolphVI/Multi-Label-Text-Classification
About Muti-Label Text Classification Based on Neural Network.
ThilinaRajapakse/pytorch-transformers-classification
Based on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for...
xuyige/BERT4doc-Classification
Code and source for paper ``How to Fine-Tune BERT for Text Classification?``
allenai/scibert
A BERT model for scientific text.