daekeun-ml/genai-ko-LLM

This hands-on lab walks you through a step-by-step approach to efficiently serving and fine-tuning large-scale Korean models on AWS infrastructure.

/ 100

Emerging

Covers QLoRA parameter-efficient fine-tuning using SageMaker training instances and multiple model serving approaches including DeepSpeed, TGI (Text Generation Inference), and NVIDIA FasterTransformer for distributed inference. Integrates with HuggingFace Hub and SageMaker's DJL container, supporting various Korean models (KULLM-Polyglot, KoAlpaca) with notebooks for local debugging before production deployment. Includes RAG implementation examples alongside inference optimization techniques.

No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 9 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Related tools

GURPREETKAURJETHRA/Llama-3-ORPO-Fine-Tuning

Llama 3 ORPO Fine Tuning on A100 in Colab Pro.

ramalamadingdong/onnx-rubikpi

ONNX LLM runtime on RUBIK-Pi with Gemma 1B and Llama 3.2 1B

keanteng/sesame-csm-elise

Fine-Tuning Sesame CSM Wth Elise. Enjoy the voice （￣︶￣）↗　

sukanyabag/Finetuning-Qwen2-7B-VQA-on-Radiology-Scans

This repository is doing the finetuning of the Qwen2 7B VLM for performing VQA (Visual Question...

DianaDorobantu/legal-llm

Develop a Romanian legal domain Large Language Model (LLM) using pre-trained model and...

Explore Generative AI Tools

All categories Trending Generative AI directory Insights