ID-ID Introduces V4 Expressive Visual Agents

New York, NY, March 16, 2026 – ID-ID, the leader in enterprise-grade AI avatar solutions, today announced the launch of V4 Expressive Visual Agents, a new generation of highly reliable digital agents designed for real-time conversations, linked by LLM, and written content for business enterprises.
Built on a new broadcast-based model and trained on performance taken from real actors, V4 Expressive Visual Agents deliver fast generation, low turnaround (less than 0.5 seconds) of conversations, and highly accurate lip syncing, up to 4K resolution, enabling expressive, natural interactions that scale reliably across business use cases.
Available today to 1500 business customers and millions of subscribers, V4 avatars are specially designed for low-latency delivery, making them ideal for real-time, chat experiences, and long-form content such as training modules, explainers, and multilingual instructional videos. To date, more than 800,000 virtual agents and 300 million interactive avatars have been created using previous D-ID models. At launch, V4 Expressive Visual Agents are available to users on all D-ID plans, starting at $5.90 per month, demonstrating the cost-effectiveness of the V4 AI model.
Research shows that human-like facial expressions improve information transfer, retention, and comprehension. As a result, businesses are increasingly adopting more reliable avatars for onboarding, training, customer engagement, and internal communications, especially where transparency, trust, and consistency are important.
V4 Expressive Visual Agents are the first high-quality avatars that display to dynamically align with selected emotions, ensuring that tone and intent match the underlying message. This allows the spoken content to be delivered clearly and confidently, with natural flow and emphasis. They are designed to act as a virtual layer for AI systems, enabling real-time, two-way interaction rather than one-way video playback. As LLM responds, the avatar automatically adapts facial expression and delivery based on context and emotion, so empathy looks empathetic, urgency feels urgent, and confidence reads like confidence. This makes both customer-facing and employee-facing agents more natural, reliable, and efficient.
V4 Expressive Visual Agents also add an optional camera layer that enables real-time emotion awareness, providing implicit cues to both LLM feedback and expressive avatar delivery, including tone and facial expression. In addition, V4 Expressive Visual Agents can display interactive UI features in-line during a conversation, share visuals such as images, charts, and video, as well as structured interactions such as forms and questions, enabled through D-ID’s MCP Apps.
Unlike short-form video production tools that are optimized for cinematic clips that last only seconds, V4 Avatars are designed for continuous, consistent output. Businesses can produce minutes or hours of video with a stable avatar identity, and conduct real-time conversations at scale, at a fraction of the price (70x cheaper than Google VEO 3 Fast), making it cost-effective for courses, presenters, multilingual training, and recurring content streams. These savings compounds when it comes to real-time interactions, costing cents per conversation when using D-ID.
“We have come a long way from our first models that delighted the world by turning static images into talking images,” said D-ID Co-founder and CEO Gil Perry. “Today, with V4, we’re setting a new benchmark for avatar reliability and performance while keeping it fast enough for real-time conversations and consistent, efficient and secure enough for business scale. This advancement in avatar technology positions D-ID at the forefront of providing the visual layer for the next wave of AI adoption as businesses seek to make communication more natural and human.”
After the acquisition of simpleshow in September 2025, D-ID expanded its business distribution area and integrated its AI avatar capabilities into the simpleshow company’s training and explanatory video ecosystem. Since then, D-ID’s ARR has grown 250%, reflecting the expansion of best-selling properties and growing business demand for AI-driven interactive video.
About D-ID
ID-ID is the world leader in artificial intelligence for video and digital people, enabling frictionless, real-time communication through the Real-Time Streaming API. Its technology powers life-like digital presenters, learning companions, and virtual assistants for Fortune 500 companies and mission-driven organizations alike. In September 2025, D-ID received a simple exhibition, a global pioneer in the creation of AI-based descriptor videos. Based in Berlin, Simple Show helps organizations in more than 70 countries simplify complex messages through intelligent, analytical, and human-centered video communications.



