CogVideo

mirror of https://github.com/THUDM/CogVideo.git synced 2025-11-20 09:52:09 +08:00

Author	SHA1	Message	Date
zR	1534bf33eb	add pipeline	2025-01-12 19:27:21 +08:00
OleehyO	fdb9820949	feat: support DeepSpeed ZeRO-3 and optimize peak memory usage - Add DeepSpeed ZeRO-3 configuration support - Optimize memory usage during training - Rename training scripts to reflect ZeRO usage - Update related configuration files and trainers	2025-01-12 05:33:56 +00:00
OleehyO	caa24bdc36	feat: add SFT support with ZeRO optimization strategies - Add SFT (Supervised Fine-Tuning) trainers for all model variants: - CogVideoX I2V and T2V - CogVideoX-1.5 I2V and T2V - Add DeepSpeed ZeRO configuration files: - ZeRO-2 with and without CPU offload - ZeRO-3 with and without CPU offload - Add base accelerate config for distributed training - Update trainer.py to support SFT training mode This enables full-parameter fine-tuning with memory-efficient distributed training using DeepSpeed ZeRO optimization.	2025-01-11 02:13:32 +00:00
OleehyO	36427274d6	style: format import statements across finetune module	2025-01-07 05:54:52 +00:00
zR	1789f07256	format and check fp16 for cogvideox2b	2025-01-07 13:16:18 +08:00
OleehyO	66e4ba2592	fix(cogvideox): add prompt embedding caching and fix frame padding - Add support for cached prompt embeddings in dataset - Fix bug where first frame wasn't properly padded in latent space	2025-01-04 06:16:42 +00:00
OleehyO	a001842834	feat: implement CogVideoX trainers for I2V and T2V tasks Add and refactor trainers for CogVideoX model variants: - Implement CogVideoXT2VLoraTrainer for text-to-video generation - Refactor CogVideoXI2VLoraTrainer for image-to-video generation Both trainers support LoRA fine-tuning with proper handling of: - Model components loading and initialization - Video encoding and batch collation - Loss computation with noise prediction - Validation step for generation	2025-01-01 15:10:54 +00:00
OleehyO	85e00a1082	feat(models): add scaffolding	2025-01-01 15:10:40 +00:00

8 Commits