Transcribe audio and video files in ComfyUI.
This package provides custom nodes for ComfyUI to perform transcription tasks on audio and video file inputs. Well suited for long duration inputs. Includes multi-language support and batch processing of many files at once.
<img src="assets/workflow_example.jpg" alt="isolated" width="800"/>Note: To text (Debug) is part of Derfuu_ModdedNodes.
Supported Transcription Models
The team supporting ComfyUI-VideoHelperSuite which provided insrumental examples in the development of this tool.