thu-nics/C2C
[ICLR'26] The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"
Enables direct KV-Cache projection and fusion between LLMs using learnable transformer-based projectors, eliminating the need for text serialization. Supports multi-model setups where multiple "sharer" models can contribute cached representations to a single "receiver" model through layer-wise alignment. Provides the `RosettaModel` wrapper for seamless integration with any HuggingFace-compatible LLM, with pre-trained fusers available on HuggingFace Hub.
361 stars.
Stars
361
Forks
41
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 05, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/thu-nics/C2C"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.