多模型平台上线

This commit is contained in:
zR 2024-08-06 11:33:11 +08:00
parent f9ec8a9569
commit 4699b5a6e6
5 changed files with 31 additions and 28 deletions

View File

@ -57,18 +57,19 @@ to [清影](https://chatglm.cn/video).
The table below shows the list of video generation models we currently provide, The table below shows the list of video generation models we currently provide,
along with related basic information: along with related basic information:
| Model Name | CogVideoX-2B | | Model Name | CogVideoX-2B |
|-------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------| |-------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Prompt Language | English | | Prompt Language | English |
| GPU Memory Required for Inference (FP16) | 18GB if using [SAT](https://github.com/THUDM/SwissArmyTransformer); 36GB if using diffusers (will be optimized before the PR is merged) | | GPU Memory Required for Inference (FP16) | 18GB if using [SAT](https://github.com/THUDM/SwissArmyTransformer); 36GB if using diffusers (will be optimized before the PR is merged) |
| GPU Memory Required for Fine-tuning(bs=1) | 40GB | | GPU Memory Required for Fine-tuning(bs=1) | 40GB |
| Prompt Max Length | 226 Tokens | | Prompt Max Length | 226 Tokens |
| Video Length | 6 seconds | | Video Length | 6 seconds |
| Frames Per Second | 8 frames | | Frames Per Second | 8 frames |
| Resolution | 720 * 480 | | Resolution | 720 * 480 |
| Quantized Inference | Not Supported | | Quantized Inference | Not Supported |
| Multi-card Inference | Not Supported | | Multi-card Inference | Not Supported |
| Download Link | 🤗 [CogVideoX-2B](https://huggingface.co/THUDM/CogVideoX-2B) | | Download Link (HF diffusers Model) | 🤗 [Huggingface](https://huggingface.co/THUDM/CogVideoX-2B) [🤖 ModelScope](https://modelscope.cn/models/ZhipuAI/CogVideoX-2b) [💫 WiseModel](https://wisemodel.cn/models/ZhipuAI/CogVideoX-2b) |
| Download Link (SAT Model) | [SAT](./sat/README.md) |
## Project Structure ## Project Structure
@ -120,7 +121,7 @@ We welcome your contributions. You can click [here](resources/contribute.md) for
The code in this repository is released under the [Apache 2.0 License](LICENSE). The code in this repository is released under the [Apache 2.0 License](LICENSE).
The model weights and implementation code are released under the [CogVideoX LICENSE](Model_License). The model weights and implementation code are released under the [CogVideoX LICENSE](MODEL_LICENSE).
## Citation ## Citation

View File

@ -54,18 +54,19 @@ CogVideoX是 [清影](https://chatglm.cn/video) 同源的开源版本视频生
下表战展示目前我们提供的视频生成模型列表,以及相关基础信息: 下表战展示目前我们提供的视频生成模型列表,以及相关基础信息:
| 模型名字 | CogVideoX-2B | | 模型名字 | CogVideoX-2B |
|----------------|--------------------------------------------------------------------------------------------------------------------------------------| |---------------------|--------------------------------------------------------------------------------------------------------------------------------------|
| 提示词语言 | English | | 提示词语言 | English |
| 推理显存消耗 (FP-16) | 36GB using diffusers (will be optimized before the PR is merged) and 18GB using [SAT](https://github.com/THUDM/SwissArmyTransformer) | | 推理显存消耗 (FP-16) | 36GB using diffusers (will be optimized before the PR is merged) and 18GB using [SAT](https://github.com/THUDM/SwissArmyTransformer) |
| 微调显存消耗 (bs=1) | 42GB | | 微调显存消耗 (bs=1) | 42GB |
| 提示词长度上限 | 226 Tokens | | 提示词长度上限 | 226 Tokens |
| 视频长度 | 6 seconds | | 视频长度 | 6 seconds |
| 帧率(每秒) | 8 frames | | 帧率(每秒) | 8 frames |
| 视频分辨率 | 720 * 480 | | 视频分辨率 | 720 * 480 |
| 量化推理 | 不支持 | | 量化推理 | 不支持 |
| 多卡推理 | 不支持 | | 多卡推理 | 不支持 |
| 权重地址 | 🤗 [CogVideoX-2B](https://huggingface.co/THUDM/CogVideoX-2B) | | 下载地址 (Diffusers 模型) | 🤗 [Huggingface](https://huggingface.co/THUDM/CogVideoX-2B) [🤖 ModelScope](https://modelscope.cn/models/ZhipuAI/CogVideoX-2b) |
| 下载地址 (SAT 模型) | [SAT](./sat/README_zh.md) |
## 项目结构 ## 项目结构

View File

@ -6,3 +6,4 @@ opencv-python>=4.10
imageio-ffmpeg>=0.5.1 imageio-ffmpeg>=0.5.1
openai>=1.38.0 openai>=1.38.0
transformers>=4.43.3 transformers>=4.43.3
accelerate>=0.33.0

View File

@ -22,7 +22,7 @@ mkdir CogVideoX-2b-sat
cd CogVideoX-2b-sat cd CogVideoX-2b-sat
wget https://cloud.tsinghua.edu.cn/f/fdba7608a49c463ba754/?dl=1 wget https://cloud.tsinghua.edu.cn/f/fdba7608a49c463ba754/?dl=1
mv 'index.html?dl=1' vae.zip mv 'index.html?dl=1' vae.zip
uzip vae.zip unzip vae.zip
wget https://cloud.tsinghua.edu.cn/f/556a3e1329e74f1bac45/?dl=1 wget https://cloud.tsinghua.edu.cn/f/556a3e1329e74f1bac45/?dl=1
mv 'index.html?dl=1' transformer.zip mv 'index.html?dl=1' transformer.zip
unzip transformer.zip unzip transformer.zip

View File

@ -21,7 +21,7 @@ mkdir CogVideoX-2b-sat
cd CogVideoX-2b-sat cd CogVideoX-2b-sat
wget https://cloud.tsinghua.edu.cn/f/fdba7608a49c463ba754/?dl=1 wget https://cloud.tsinghua.edu.cn/f/fdba7608a49c463ba754/?dl=1
mv 'index.html?dl=1' vae.zip mv 'index.html?dl=1' vae.zip
uzip vae.zip unzip vae.zip
wget https://cloud.tsinghua.edu.cn/f/556a3e1329e74f1bac45/?dl=1 wget https://cloud.tsinghua.edu.cn/f/556a3e1329e74f1bac45/?dl=1
mv 'index.html?dl=1' transformer.zip mv 'index.html?dl=1' transformer.zip
unzip transformer.zip unzip transformer.zip