diff --git a/README.md b/README.md index 9693079..8e04557 100644 --- a/README.md +++ b/README.md @@ -14,7 +14,7 @@ Experience the CogVideoX-5B model online at paper and user guide

- 👋 Join our WeChat and Discord + 👋 Join our WeChat and Discord

📍 Visit QingYing and API Platform to experience larger-scale commercial video generation models. diff --git a/README_ja.md b/README_ja.md index 7ea9851..1bf5830 100644 --- a/README_ja.md +++ b/README_ja.md @@ -14,7 +14,7 @@ 📚 論文使用ドキュメントを表示します。

- 👋 WeChatDiscord に参加 + 👋 WeChatDiscord に参加

📍 清影APIプラットフォーム を訪問して、より大規模な商用ビデオ生成モデルを体験 diff --git a/README_zh.md b/README_zh.md index 89eab9c..38821a5 100644 --- a/README_zh.md +++ b/README_zh.md @@ -15,7 +15,7 @@ 📚 查看 论文使用文档

- 👋 加入我们的 微信Discord + 👋 加入我们的 微信Discord

📍 前往 清影 API平台 体验更大规模的商业版视频生成模型。 diff --git a/sat/dit_video_concat.py b/sat/dit_video_concat.py index 12e0be3..e77e7d0 100644 --- a/sat/dit_video_concat.py +++ b/sat/dit_video_concat.py @@ -773,12 +773,15 @@ class DiffusionTransformer(BaseModel): b, t, d, h, w = x.shape if x.dtype != self.dtype: x = x.to(self.dtype) + + # This is not use in inference if "concat_images" in kwargs and kwargs["concat_images"] is not None: if kwargs["concat_images"].shape[0] != x.shape[0]: concat_images = kwargs["concat_images"].repeat(2, 1, 1, 1, 1) else: concat_images = kwargs["concat_images"] x = torch.cat([x, concat_images], dim=2) + assert (y is not None) == ( self.num_classes is not None ), "must specify y if and only if the model is class-conditional"