CogVideo

mirror of https://github.com/THUDM/CogVideo.git synced 2025-04-05 03:04:56 +08:00

Author	SHA1	Message	Date
Yuxuan Zhang	39c6562dc8	format	2025-03-22 15:14:06 +08:00
OleehyO	36427274d6	style: format import statements across finetune module	2025-01-07 05:54:52 +00:00
zR	1789f07256	format and check fp16 for cogvideox2b	2025-01-07 13:16:18 +08:00
OleehyO	e5b8f9a2ee	feat: add caching for prompt embeddings - Add caching for prompt embeddings - Store cached files using safetensors format - Add cache directory structure under data_root/cache - Optimize memory usage by moving tensors to CPU after caching - Add debug logging for cache hits - Add info logging for cache writes The caching system helps reduce redundant computation and memory usage during training by: 1. Caching prompt embeddings based on prompt text hash 2. Caching encoded video latents based on video filename 3. Moving tensors to CPU after caching to free GPU memory	2025-01-04 06:16:31 +00:00
OleehyO	6eae5c201e	feat: add latent caching for video encodings - Add caching mechanism to store VAE-encoded video latents to disk - Cache latents in a "latent" subdirectory alongside video files - Skip re-encoding when cached latent file exists - Add logging for successful cache saves - Minor code cleanup and formatting improvements This change improves training efficiency by avoiding redundant video encoding operations.	2025-01-01 15:10:42 +00:00
OleehyO	918ebb5a54	feat(datasets): implement video dataset modules - Add dataset implementations for text-to-video and image-to-video - Include bucket sampler for efficient batch processing - Add utility functions for data processing - Create dataset package structure with proper initialization	2025-01-01 15:10:40 +00:00

6 Commits