zzy99/epidemic-sentence-pair
天池 疫情相似句对判定大赛 线上第一名方案
ArchivedImplements semantic similarity matching for pandemic-related medical queries using ensemble methods combining BERT-wwm-ext, ERNIE-1.0, and RoBERTa-large-pair with k-fold cross-validation. The approach incorporates adversarial training on embedding layers, symmetric/transitive data augmentation, and sigmoid-space probability averaging to improve discrimination between paraphrased vs. semantically distinct question pairs. Additionally uses pseudo-labeling and threshold tuning (0.47) optimized on domain-specific data from Tianchi competition containing ~10K real clinical question pairs.
435 stars. No commits in the last 6 months.
Stars
435
Forks
75
Language
Python
License
—
Category
Last pushed
Oct 17, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/zzy99/epidemic-sentence-pair"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ShawnyXiao/2017-CCF-BDCI-AIJudge
2017-CCF-BDCI-让AI当法官(初赛):7th/415 (Top 1.68%)
ShawnyXiao/2018-DC-DataGrand-TextIntelProcess
2018-DC-“达观杯”文本智能处理挑战赛:冠军 (1st/3131)
beader/tianchi_nl2sql
追一科技首届中文NL2SQL挑战赛决赛第3名方案+代码
rogeroyer/2019-CCF-BDCI-Finance-Information-Negative-Judgment
top1-solution
zhanzecheng/SOHU_competition
Sohu's 2018 content recognition competition 1st solution(搜狐内容识别大赛第一名解决方案)