Hub Chat Mode
Engage in text-based conversations with your AI agents while receiving both written responses and voice narration. Chat mode provides a comprehensive transcript, real-time status updates, and full access to agent options and settings.Starting a Chat Session
Activation
Click the Chat button in the top-left corner of any Hub scene to enter chat mode. The system will establish a WebRTC connection and display a green dot when ready for interaction.Connection Status Indicators
- Green Dot: Agent connected and ready for chat
- Amber Dot: System connecting, please wait
- Red Dot: Connection failed, click to retry or refresh page
Chat Interface Components
Top Controls
Left Side Controls:- Chat Button: Current active mode (highlighted when selected)
- Talk Button: Switch to voice mode anytime during conversation
- Gear Icon: Access options for agent selection, voice settings, and LLM models
Chat Input Area
Bottom Center:- Message Input Bar: Type your questions and messages
- Send Button: Submit messages to your agent
- Transcript Toggle: Show/hide conversation transcript on right side
Conversation Transcript
Right Side Panel:- Message History: Complete conversation record with timestamps
- Agent Responses: Full text of agent replies
- User Messages: Your questions and inputs
- Time Stamps: Precise timing for each message exchange
- Scrollable Interface: Review entire conversation history

Chat Interface Variations
Basic Chat Mode:


Advanced Chat Features
Detailed Agent Responses:

Real-Time Agent Actions Example
During conversations, agents can perform complex multi-step actions like sending emails, conducting research, and managing tasks. Here’s an example showing an agent handling email management:
Extended chat conversation showing agent performing email tasks and providing detailed responses with timestamps
Interactive Features
Real-Time Status Display
Bottom Status Bar shows agent activity:- Searching: Agent retrieving information from knowledge sources
- Web Searching: Performing live web searches for current information
- Tool Calling: Using connected applications via MCP workflows
- Memory Updates: Storing important conversation details
- Message Processing: Generating responses and formulating replies
Agent Response Format
Dual Response System:- Text Display: Written responses appear in transcript
- Voice Narration: Agent speaks responses aloud simultaneously
- Rich Content: Support for formatted text, lists, and structured information
- Action Confirmations: Notifications when agent performs tasks
Smart Memory Integration
Automatic Information Storage:- Personal Details: Names, preferences, contact information
- Task Management: Important tasks, deadlines, and follow-ups
- Context Retention: Key conversation points and decisions
- Session Continuity: Information carries across multiple conversations
Agent Options and Settings
Access comprehensive agent controls through the gear icon in the top-left corner.Agent Selection
Agent Dropdown:- Available Agents: All Worker and presenter agents appear in the dropdown menu
- Switch Anytime: Change agents without losing conversation context
- Agent-Specific: Each agent brings its own knowledge, presentation deck, and behavior
- Seamless Transition: Conversation continues with the new agent context
Voice Selection
Voice Options:- Gemini voices: Sportsman, Customer support, Sarah, Brooke, Katie, Zemo, ajith, duaila, azj, ajz, sjl, brit, Swissen (shown when a Gemini LLM is active)
- OpenAI Realtime voices: Alloy, Echo, Shimmer, Ash, Ballad, Coral, Sage, Verse, Cedar, Marin (shown when an OpenAI Realtime LLM is active)
- Real-Time Switch: Voice changes take effect immediately
- WebRTC Reconnection: Brief reconnection when changing voice settings
LLM Model Selection
AI Model Options:- OpenAI: GPT 4.1 mini, GPT 4.1, GPT 5, GPT 5 nano, GPT 5 mini
- Gemini: Gemini 2.5 Flash Lite, Gemini 2.5 Pro, Gemini 2.5 Flash
- OpenAI Realtime: GPT Realtime, GPT‑4o Realtime, GPT Realtime Mini
- Groq hosted: GPT OSS 20B, GPT OSS 120B, Qwen3‑32B, Moonshotai Kimi K2
- Performance Impact: Different models affect response speed, reasoning depth, and voice availability
Generate Images and Videos
Use the chat input bar to create media in just a few steps:Pick an Agent and Model
Confirm the active persona and LLM using the gear icon in the Hub. This also sets the available voice for realtime commentary.
Prompt an Image
In the bottom input bar, describe the image you want (e.g., “Create an image of a futuristic city skyline at night”) and press Enter.
Review the Result
The agent replies with a thumbnail preview and confirmation once the image is ready. Click the thumbnail to view the full asset.
Conversation History
Session Management:- Previous Conversations: Access to past chat sessions
- Load History: Bring previous conversations into current session
- Context Integration: Historical context informs current responses
- Organized Records: Conversations organized by date and agent
Advanced Chat Capabilities
Web Search Integration
When agents perform web searches:- Real-Time Results: Search results appear as agent finds information
- Source Citations: Clear attribution for web-sourced information
- Interactive Elements: In Web Search scene, results appear as 3D widgets
- Fact Verification: Agent can verify and cross-reference information
Tool Integration and Actions
Connected Application Access:- MCP Tools: Integration with 200+ applications
- Email Management: Send emails through connected Gmail
- Calendar Operations: Schedule meetings and manage appointments
- CRM Updates: Update customer records and contact information
- Phone Integration: Make calls via Twilio integration
Multi-Modal Content
Rich Interaction Support:- Website Integration: Display live websites in Zen scenes
- Presentation Control: Navigate slides in Presentation scenes
- Video Integration: Show YouTube videos while chatting
- Document Processing: Handle uploaded PDFs and presentations

Chat Session Management
Session Continuity
- 20-Minute Timeout: Sessions automatically timeout after inactivity
- Session Restart: Click Chat button to restart after timeout
- Context Preservation: Important information retained across sessions
- Page Refresh: Alternative method to restart stalled sessions
Mode Switching
Seamless Transitions:- Chat to Talk: Click Talk button to switch to voice mode
- Context Retention: Conversation continues without interruption
- Settings Preservation: Agent, voice, and model selections maintained
- Real-Time Switch: Immediate transition between interaction modes
Performance Optimization
Optimal Chat Experience:- Stable Internet: Ensure reliable connection for WebRTC performance
- Browser Permissions: Grant necessary audio permissions even for chat
- Regular Updates: Keep browser updated for best compatibility
- Memory Management: Clear browser cache if experiencing slowdowns
Best Practices
Effective Chat Communication
- Clear Questions: Ask specific, well-formed questions
- Context Provision: Provide relevant background information
- Follow-Up Questions: Build on agent responses for deeper information
- Task Specification: Be specific about desired actions or outcomes
Feature Utilization
- Transcript Review: Use transcript to track important information
- Agent Switching: Try different agents for varied perspectives
- Tool Integration: Leverage connected apps for enhanced functionality
- History Access: Review previous conversations for context
Troubleshooting
- Connection Issues: Check for green dot before starting conversations
- Response Delays: Allow time for complex searches and tool operations
- Missing Transcript: Toggle transcript visibility using input bar control
- Audio Problems: Check browser permissions even though primarily text-based
Integration with Other Features
Scene Compatibility
- All Scenes: Chat mode works in every available scene
- Interactive Widgets: Enhanced experience in Zen and Web Search scenes
- Presentation Integration: Navigate presentations via chat commands
- Video Wall: Chat while custom videos play in background
Avatar Consistency
- Avatar Switching: Change avatars without losing chat context
- Voice Matching: Avatar voice settings apply to chat narration
- Character Persistence: Avatar personality maintained throughout chat
