Kalebu/Plagiarism-checker-Python
A python project for checking plagiarism of documents based on cosine similarity
Converts raw text documents into numerical vectors using TF-IDF-style transformations, then computes pairwise cosine similarity scores to identify duplicate or near-duplicate documents. Automatically discovers and processes all `.txt` files in the project directory, outputting similarity tuples for each document pair. Includes a companion library ([Pysimilar](https://github.com/Kalebu/pysimilar)) for simplified string comparison without manual vectorization.
325 stars. No commits in the last 6 months.
Stars
325
Forks
174
Language
Python
License
—
Category
Last pushed
Aug 05, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Kalebu/Plagiarism-checker-Python"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
KOKOSde/localmod
Self-hosted content moderation API that outperforms Amazon Comprehend. 100% offline, your data...
ogulcanaydogan/AI-Provenance-Tracker
Open-source multi-modal AI content detection platform, analyses text, images, audio, and video...
credo-ai/credoai_lens
Credo AI Lens is a comprehensive assessment framework for AI systems. Lens standardizes model...
jina-ai/example-app-store
App store search example, using Jina as backend and Streamlit as frontend
Alex0Blackwell/bias-monitor
A Chrome Extension that promotes politically diverse news reading with Artificial Intelligence!