aahouzi/llama2-chatbot-cpu

A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTorch with bfloat16.

/ 100

Experimental

No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 9 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Category

messaging-platform-chatbots

Last pushed

Feb 27, 2024

Commits (30d)

GitHub

Messaging Platform Chatbots · 75 models

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/transformers/aahouzi/llama2-chatbot-cpu"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

jakobdylanc/llmcord

Make Discord your LLM frontend - Supports any OpenAI compatible API (Ollama, xAI, Gemini,...

xNul/chat-llama-discord-bot

A Discord Bot for chatting with LLaMA, Vicuna, Alpaca, MPT, or any other Large Language Model...

amanvirparhar/weebo

A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.

OFA-Sys/ExpertLLaMA

An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.

ma2za/telegram-llm-bot

Telegram LLM bot backed by OpenAI, Whisper, Beam, LLaMA, Weaviate, MinIO and MongoDB

Explore Transformer Models

All categories Trending Transformer directory Insights