mich1803/Codenames-LLM
Building an AI team to play Codenames using top Large Language Models (LLMs), evaluating performance, and pitting them against each other. Explore their strategy and capabilities in this interactive competition!
No commits in the last 6 months.
Stars
2
Forks
2
Language
Jupyter Notebook
License
—
Category
Last pushed
Dec 27, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/mich1803/Codenames-LLM"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Higher-rated alternatives
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral,...
IBM/unitxt
🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the...
lean-dojo/LeanDojo
Tool for data extraction and interacting with Lean programmatically.
GoodStartLabs/AI_Diplomacy
Frontier Models playing the board game Diplomacy.
MigoXLab/LMeterX
A general-purpose API load testing platform that supports LLM services and business HTTP...