The-Swarm-Corporation/DPO-MCTS-ToT-Training

This module implements a post-training mechanism that allows a language model to explore various reasoning branches (chain-of-thoughts) using a Monte Carlo Tree Search (MCTS) framework. It selects the branch with the best answer using a cosine similarity evaluator that compares the candidate answer to a known correct answer.

/ 100

Experimental

No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 9 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Category

llm-framework-abstractions

Last pushed

Feb 11, 2025

Commits (30d)

GitHub

LLM Framework Abstractions · 139 agents

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/agents/The-Swarm-Corporation/DPO-MCTS-ToT-Training"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

InfinitiBit/graphbit

GraphBit is the world’s first enterprise-grade Agentic AI framework, built on a Rust core with a...

autogluon/autogluon-assistant

Multi-Agent System Powered by LLMs for End-to-end Multimodal ML Automation

samholt/L2MAC

🚀 The LLM Automatic Computer Framework: L2MAC

pguso/agents-from-scratch

Build AI agents from first principles using a local LLM - no frameworks, no cloud APIs, no...

wjayesh/mahilo

mahilo: Multi-Agent Human-in-the-Loop Framework is a flexible framework for creating multi-agent...

Explore AI Agents

All categories Trending AI Agent directory Insights