LLM-TrainHub

LLM-TrainHub is a centralized collection of independent projects designed to facilitate the training of large language models (LLMs) using various techniques such as SFT , DPO. It is designed for flexibility, ease of modification, and maintaining a consistent style across projects. The core objective is to enable users to easily adapt the code for their own needs, particularly around custom data loading, model architecture adjustments, and custom loss functions, while supporting common LLM training setups.

Features

Modular Projects: Each project is independent and follows a unified structure for easy navigation and customization.
Pre-built Scripts: Example data, training, and inference scripts are provided for quick setup and usage.
Native Transformers Integration: Wherever possible, the training framework utilizes Hugging Face's transformers library, specifically the Trainer class, for a seamless experience.
Multi-GPU Support: Distributed Data Parallel (DDP) and DeepSpeed are supported for multi-GPU training.
Compatibility: Verified to work with the Qwen2 series of models.

Goals

Customizable Workflows: Allow for easy modifications in data loading, model architecture, and loss functions.
Efficient Training: Supports state-of-the-art techniques for LLM training, including LoRA fine-tuning and embedding generation.
Flexible Model Handling: Provides utilities for custom model saving and loading workflows.

Project Plans

✅ llm-lora-simple: A simplified single-machine LoRA fine-tuning setup for large language models.
- 🔄 Migration to DDP, with further adjustments planned.
✅ llm-embedding: A project to generate embeddings using LLMs.
- ✅ Fine-tune LLM parameters to generate embeddings.
- ✅ Freeze LLM parameters and add a fully connected Adapter layer to generate embeddings.
🔄 LLM Full-Parameter SFT: Full parameter supervised fine-tuning of large language models.
🔄 LLM-LoRA Fine-Tuning: Fine-tuning large language models using LoRA, with plans for scalability improvements.
⏳ LLM-DPO Training: Training LLMs using direct preference optimization techniques.
⏳ Notellm Replication: Reproducing the Notellm model and its associated tasks.
⏳ VLM Fine-Tuning: Fine-tuning vision-language models for specific tasks.

目标：

每个项目相对独立，方便修改，提供示例数据、训练、预测脚本
尽可能使用原生transformers的trainer
支持ddp、deepspeed
在qwen2-1.5b和7b跑通

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
llm-dpo		llm-dpo
llm-embedding		llm-embedding
llm-lora-simple		llm-lora-simple
llm-sft		llm-sft
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM-TrainHub

Features

Goals

Project Plans

About

Releases

Packages

Languages

yinpu/LLM-TrainHub

Folders and files

Latest commit

History

Repository files navigation

LLM-TrainHub

Features

Goals

Project Plans

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages