From 2da594c831685003214a230df8a53c3f84b4fb0e Mon Sep 17 00:00:00 2001
From: zR <2448370773@qq.com>
Date: Tue, 6 Aug 2024 02:16:43 +0800
Subject: [PATCH] GPU memory cost changed

---
 README.md    | 8 +++++---
 README_zh.md | 2 +-
 2 files changed, 6 insertions(+), 4 deletions(-)

diff --git a/README.md b/README.md
index 97415bb..3db483e 100644
--- a/README.md
+++ b/README.md
@@ -18,7 +18,8 @@
 
 ## Update and News
 
-- 🔥 **News**: ``2024/8/6``: We have also open-sourced **3D Causal VAE** used in **CogVideoX-2B**, which can reconstruct the video almost losslessly.
+- 🔥 **News**: ``2024/8/6``: We have also open-sourced **3D Causal VAE** used in **CogVideoX-2B**, which can reconstruct
+  the video almost losslessly.
 - 🔥 **News**: ``2024/8/6``: We have open-sourced **CogVideoX-2B**，the first model in the CogVideoX series of video
   generation models.
 
@@ -57,7 +58,7 @@ along with related basic information:
 | Model Name                                | CogVideoX-2B                                                 | 
 |-------------------------------------------|--------------------------------------------------------------|
 | Prompt Language                           | English                                                      | 
-| GPU Memory Required for Inference (FP16)  | 21.6GB                                                       | 
+| GPU Memory Required for Inference (FP16)  | 36GB (will be optimized before the PR is merged)             | 
 | GPU Memory Required for Fine-tuning(bs=1) | 46.2GB                                                       |
 | Prompt Max  Length                        | 226 Tokens                                                   |
 | Video Length                              | 6 seconds                                                    | 
@@ -76,7 +77,8 @@ of the **CogVideoX** open-source model.
 
 + [cli_demo](inference/cli_demo.py): A more detailed explanation of the inference code, mentioning the significance of
   common parameters.
-+ [cli_vae_demo](inference/cli_vae_demo.py): Executing the VAE inference code alone currently requires 71GB of memory, but it will be optimized in the future.
++ [cli_vae_demo](inference/cli_vae_demo.py): Executing the VAE inference code alone currently requires 71GB of memory,
+  but it will be optimized in the future.
 + [convert_demo](inference/converter_demo.py): How to convert user input into a format suitable for CogVideoX.
 + [web_demo](inference/web_demo.py): A simple streamlit web application demonstrating how to use the CogVideoX-2B model
   to generate videos.
diff --git a/README_zh.md b/README_zh.md
index f864e06..b831df2 100644
--- a/README_zh.md
+++ b/README_zh.md
@@ -55,7 +55,7 @@ CogVideoX是 [清影](https://chatglm.cn/video) 同源的开源版本视频生
 | 模型名字           | CogVideoX-2B                                                 | 
 |----------------|--------------------------------------------------------------|
 | 提示词语言          | English                                                      | 
-| 推理显存消耗 (FP-16) | 21.6GB                                                       | 
+| 推理显存消耗 (FP-16) | 36GB                                                         | 
 | 微调显存消耗 (bs=1)  | 46.2GB                                                       |
 | 提示词长度上限        | 226 Tokens                                                   |
 | 视频长度           | 6 seconds                                                    |