ai agentsIntroducing VideoSDK Phone Numbers: Build AI Call Agents in 60 secondsToday, we’re launching VideoSDK Phone Numbers, a first-party telephony capability that lets you connect AI voice agents directly to the phone network.
ai agentsIntroducing the Ultravox Realtime Plugin in VideoSDKLearn more about building real-time voice agents with Ultravox and VideoSDK Agents, where listening, reasoning, and speaking happen together for ultra-low latency, natural conversations.
ai agentsIntroducing xAI Grok Real-Time Speech-to-Speech Plugin for VideoSDK AgentsBuild real-time voice and text agents with xAI’s Grok now natively integrated into VideoSDK Agents for multimodal, context-aware AI experiences.
ai agentsIntroducing the Nvidia Speech to Text Plugin in VideoSDKLearn how to integrate NVIDIA STT with the VideoSDK Agents SDK to generate fast, accurate, and production-ready transcriptions.
ai agentsIntroducing the MurfAI Text To Speech Plugin in VideoSDKLearn how to integrate Murf AI Text-to-Speech with VideoSDK Agents to generate natural, expressive, and low-latency voice output for AI agents.
ai agentsIntroducing the Nvidia Text to Speech Plugin in VideoSDKLearn how to integrate NVIDIA Riva TTS with the VideoSDK Agents SDK to deliver real-time, low-latency speech that makes AI voice agents sound natural, responsive, and production-ready.
pluginsIntroducing the Gladia Speech to Text Plugin in VideoSDKWe’re introducing the Gladia Speech-to-Text plugin for VideoSDK. With multilingual support, instant partial results, and handling of mixed languages, it provides a reliable speech input layer for voice-driven applications.
ai agentsIntroducing Testing and Evaluation in AI Voice AgentsLearn how to run testing and evaluation for AI voice agents using the VideoSDK Agent SDK, including STT, LLM, and TTS benchmarking, latency metrics, and LLM-based response judging.
ai agentsHow to Build an AI Voice System Using Real-Time Multi-Agent SwitchingIn this blog you'll learn about how to build an AI systems with multi-agent switching that intelligently transfer control between specialized agents. Keep conversations natural, tasks organized, and users engaged by letting each agent focus on what it does best.