To use stepfun's library, you need an official api that supports multimodal inputs such as video and pictures a/https://platform.stepfun.com/request-restriction
This is a collection of ComfyUI custom nodes based on the StepFun API. These nodes enable easy implementation of intelligent analysis and processing for images and videos. Currently, video upload is not implemented, and video URL links are required.
🚧 More nodes are being updated...
custom_nodes
directory:cd custom_nodes
git clone https://github.com/your-repo/ComfyUI_StepFun.git
cd ComfyUI_StepFun
pip install -r requirements.txt
The following examples are provided in the workflow
folder:
workflow/role2story.json
🚧 More workflow examples are being updated...
You can now use the StepFun Video File Uploader
node for direct video uploads. Simply provide the absolute path to your video file (e.g., C:\Users\Administrator\Downloads\test.mp4
). The node will return a stepfile://file-id
string that can be used as input for the video_url
parameter in the StepFun Video Chat
node.
For videos larger than 128MB or in different formats, you can use FFmpeg for preprocessing:
To split a large video into smaller segments (120-second chunks):
ffmpeg -i sample.mp4 -acodec copy -f segment -segment_time 120 -vcodec copy -reset_timestamps 1 -map 0 output_time_%d.mp4
To convert videos from other formats to MP4:
ffmpeg -i sample.mkv -codec copy sample.mp4
Note: It's recommended to compress videos before uploading for better performance.
If you encounter any issues while using this project, please feel free to contact us through the above methods.