Nodes Browser
ComfyDeploy: How ComfyUI-TeaCache works in ComfyUI?
What is ComfyUI-TeaCache?
Unofficial implementation of [ali-vilab/TeaCache](https://github.com/ali-vilab/TeaCache) for ComfyUI
How to install it in ComfyDeploy?
Head over to the machine page
- Click on the "Create a new machine" button
- Select the
Edit
build steps - Add a new step -> Custom Node
- Search for
ComfyUI-TeaCache
and select it - Close the build step dialig and then click on the "Save" button to rebuild the machine
ComfyUI-TeaCache
Introduction
Timestep Embedding Aware Cache (TeaCache) is a training-free caching approach that estimates and leverages the fluctuating differences among model outputs across timesteps, thereby accelerating the inference. TeaCache works well for Image Diffusion models, Video Diffusion Models, and Audio Diffusion Models.
TeaCache has now been integrated into ComfyUI and is compatible with the ComfyUI native nodes. ComfyUI-TeaCache is easy to use, simply connect the TeaCache node with the ComfyUI native nodes for seamless usage.
Updates
- Mar 26 2025: ComfyUI-TeaCache supports retention mode for Wan2.1 models and HunyuanVideo I2V v2 model:
- Retention mode for Wan2.1 models can bring faster generation and better generation quality.
- Fixes a bug about HunyuanVideo I2V v2 model.
- Mar 10 2025: ComfyUI-TeaCache adds max_skip_steps option and has made some changes for ease of use:
- Add max_skip_steps option to enjoy a good trade-off between quality and speed for Wan2.1 models. The best settings are shown in the usage section.
- Merge TeaCache For Img Gen and TeaCache For Vid Gen nodes into a single TeaCache node.
- Update TeaCache for HunyuanVideo and LTX-Video to match the offical ComfyUI update.
- Mar 6 2025: ComfyUI-TeaCache supports Wan2.1:
- It can achieve a 1.5x lossless speedup and a 2x speedup without much visual quality degradation.
- Support Text to Video and Image to Video!
- Jan 17 2025: ComfyUI-TeaCache supports CogVideoX:
- It can achieve a 1.5x lossless speedup and a 2x speedup without much visual quality degradation.
- Support Text to Video and Image to Video!
- Note that TeaCache for CogVideoX node needs to be used with kijai's ComfyUI-CogVideoXWrapper nodes.
- Jan 15 2025: Thanks @TangYanxin, ComfyUI-TeaCache supports PuLID-FLUX and fixes bug about rel_l1_thresh:
- It can achieve a 1.2x lossless speedup and a 1.7x speedup without much visual quality degradation.
- Fixes a bug about rel_l1_thresh, when there are multiple TeaCache nodes in a workflow, the rel_l1_thresh value is always the value of the last TeaCache node.
- Jan 14 2025: ComfyUI-TeaCache supports Compile Model and fixes a bug that TeaCache keeps forever even if we remove/bypass the node:
- Support Compile Model, now it can bring a faster inference when you add Compile Model node!
- Fixes a bug related to usability, now we can go back to the workflow state without TeaCache if we remove/bypass TeaCache node.
- Jan 13 2025: Thanks @TangYanxin, ComfyUI-TeaCache remove the Steps setting from the node:
- Now, it works fine even if there are multiple sampling nodes with different sampling steps in the workflow.
- Fixes a bug, RuntimeError: The size of tensor a must match the size of tensor b at non-singleton dimension.
- Jan 10 2025: ComfyUI-TeaCache supports LTX-Video:
- It can achieve a 1.4x lossless speedup and a 1.7x speedup without much visual quality degradation.
- Support Text to Video and Image to Video!
- Jan 9 2025: ComfyUI-TeaCache supports HunyuanVideo:
- It can achieve a 1.6x lossless speedup and a 2x speedup without much visual quality degradation.
- Jan 8 2025: ComfyUI-TeaCache supports FLUX:
- It can achieve a 1.4x lossless speedup and a 2x speedup without much visual quality degradation.
- Support FLUX LoRA!
- Support FLUX ControlNet!
Installation
Installation via ComfyUI-Manager is preferred. Simply search for ComfyUI-TeaCache in the list of nodes and click install.
Manual installation
- Go to comfyUI custom_nodes folder,
ComfyUI/custom_nodes/
- git clone https://github.com/welltop-cn/ComfyUI-TeaCache.git
- pip install -r requirements.txt
Usage
TeaCache
To use TeaCache node, simply add TeaCache
node to your workflow after Load Diffusion Model
node or Load LoRA
node (if you need LoRA). Generally, TeaCache can achieve a speedup of 1.5x to 3x with acceptable visual quality loss. The following table gives the recommended rel_l1_thresh and max_skip_steps for different models:
| Models | rel_l1_thresh | max_skip_steps | speedup | |:----------------------------:|:-----------------:|:-----------------:|:-----------------:| | FLUX | 0.4 | 3 | ~2x | | PuLID-FLUX | 0.4 | 3 | ~1.7x | | HunyuanVideo | 0.15 | 3 | ~1.9x | | LTX-Video | 0.06 | 3 | ~1.7x | | CogVideoX | 0.3 | 3 | ~2x | | Wan2.1-T2V-1.3B | 0.08 | 3 | ~1.6x | | Wan2.1-T2V-14B | 0.2 | 3 | ~1.8x | | Wan2.1-I2V-480P-14B | 0.26 | 3 | ~1.9x | | Wan2.1-I2V-720P-14B | 0.25 | 3 | ~1.6x | | Wan2.1-T2V-1.3B-ret-mode | 0.15 | 3 | ~2.2x | | Wan2.1-T2V-14B-ret-mode | 0.2 | 3 | ~2.1x | | Wan2.1-I2V-480P-14B-ret-mode | 0.3 | 3 | ~2.3x | | Wan2.1-I2V-720P-14B-ret-mode | 0.3 | 3 | ~2.0x |
</div>If the video after applying TeaCache is of low quality, such as a lower range of motion or the still frames, please reduce rel_l1_thresh or max_skip_steps.
The demo workflows (flux, pulid_flux, hunyuanvideo, ltx_video, cogvideox, wan2.1_t2v and wan2.1_i2v) are placed in examples folder.
Compile Model
To use Compile Model node, simply add Compile Model
node to your workflow after Load Diffusion Model
node or TeaCache
node. Compile Model uses torch.compile
to enhance the model performance by compiling model into more efficient intermediate representations (IRs). This compilation process leverages backend compilers to generate optimized code, which can significantly speed up inference. The compilation may take long time when you run the workflow at first, but once it is compiled, inference is extremely fast. The usage is shown below:
Result comparison
- <p><strong>FLUX</strong></p>
- <p><strong>PuLID-FLUX</strong></p>
- <p><strong>HunyuanVideo</strong></p>
https://github.com/user-attachments/assets/b3aca64d-c2ae-440c-a362-f3a7b6c633e0
- <p><strong>LTX-Video</strong></p>
https://github.com/user-attachments/assets/8fce9b48-2243-46f1-b411-80e4a53f6f7d
- <p><strong>CogVideoX</strong></p>
https://github.com/user-attachments/assets/de8f5400-e885-446d-936f-c026a78ba5c2
- <p><strong>Wan2.1-T2V</strong></p>
https://github.com/user-attachments/assets/8a84a862-1947-4ba8-8298-047d50d8f670
- <p><strong>Wan2.1-I2V</strong></p>
https://github.com/user-attachments/assets/0ed5c2c6-b749-41cb-9e2b-e152ecd6c7c1
Acknowledgments
Thanks to TeaCache repo owner ali-vilab/TeaCache: Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model