D-ID Launches Chat API to Give Generative AI Chatbots a Face and Voice
Synthetic media startup D-ID has introduced a chat API enabling conversations with virtual beings produced through its Creative Reality Studio. The chat API offers businesses a way to engage with customers that takes advantage of multiple forms of generative AI.
The new chat API augments D-ID’s text-to-video generative AI tools with a new real-time streaming function. The conversational abilities of large language models like ChatGPT could then have a voice and face laid over the text. D-ID envisions the chat API producing hyperrealistic three-dimensional virtual assistants looking, sounding, and responding like a human for sales, training, customer service, and other tasks.
“Previously [when] you generate a video, you need to wait for the video to be ready and then play. With the new real-time steaming capability, you can generate the video, and the video immediately starts streaming to the user and in parallel keeps on processing, exactly like when you’re watching a YouTube video,” D-ID vice president of product Yaniv Levi told Voicebot in an interview. “With the new real-time streaming capabilities, I really expect to see a revolution in everything related to customer service and customer experience because when people start using i,t they’ll realize it’s like magic.”
D-ID has been rapidly incorporating generative AI into its services since launching the Creative Reality Studio a year ago. The platform enables customers to design video avatars based on uploaded photographs or from synthetically generated images produced by Stable Diffusion’s text-to-image engine. The avatar can perform a script written by the user or composed by OpenAI’s GPT-3 text generator. The chat API opens the door to real-time interactions using responses streamed from generative AI chatbots.
“Large language models like GPT-3 and LaMDA are changing the way we relate to and interact with technology, and we are not far off from all of us having our own personalized AI assistants and companions,” D-ID CEO Gil Perry said. “We are making tech more human by giving it a face and making the interaction more natural. I am very proud of D-ID, which continues to be at the cutting edge of the emergent generative AI industry.”