Daifuku is a versatile framework designed to serve multiple Text-to-Video (T2V) models (e.g., Mochi, LTX, and more). It streamlines T2V model deployment by providing:
A unified API for multiple models
Parallel batch processing
GPU optimizations for efficiency
Easy Docker-based deployment
Integrated monitoring, logging, and metrics
Inspired by the concept of daifuku mochi—a sweet stuffed treat—this framework “stuffed” with multiple T2V capabilities aims to make your video generation as sweet and satisfying as possible.
Built with