AI Video Avatar

Multimodal AI avatar system that converts text and speech to synchronized video using GeneFace++ and Google STT & TTS, achieving sub-2-minute processing time.