Skip to main content

Hub Chat Mode

Engage in text-based conversations with your AI agents while receiving both written responses and voice narration. Chat mode provides a comprehensive transcript, real-time status updates, and full access to agent options and settings.

Starting a Chat Session

Activation

Click the Chat button in the top-left corner of any Hub scene to enter chat mode. The system will establish a WebRTC connection and display a green dot when ready for interaction.

Connection Status Indicators

  • Green Dot: Agent connected and ready for chat
  • Amber Dot: System connecting, please wait
  • Red Dot: Connection failed, click to retry or refresh page

Chat Interface Components

Top Controls

Left Side Controls:
  • Chat Button: Current active mode (highlighted when selected)
  • Talk Button: Switch to voice mode anytime during conversation
  • Gear Icon: Access options for agent selection, voice settings, and LLM models

Chat Input Area

Bottom Center:
  • Message Input Bar: Type your questions and messages
  • Send Button: Submit messages to your agent
  • Transcript Toggle: Show/hide conversation transcript on right side

Conversation Transcript

Right Side Panel:
  • Message History: Complete conversation record with timestamps
  • Agent Responses: Full text of agent replies
  • User Messages: Your questions and inputs
  • Time Stamps: Precise timing for each message exchange
  • Scrollable Interface: Review entire conversation history
Chat Interface with Transcript Chat interface showing conversation transcript, agent options, and real-time status indicators

Chat Interface Variations

Basic Chat Mode: Chat Basic Interface Clean chat interface with message input and avatar display Chat with Transcript View: Chat Transcript View Chat mode showing expanded transcript panel with conversation history and timestamps Extended Conversation Management: Chat Extended Conversation Extended chat session showing multi-turn conversations with comprehensive agent responses

Advanced Chat Features

Detailed Agent Responses: Chat Detailed Responses Chat interface displaying comprehensive agent responses with rich formatting and detailed explanations Task Management Integration: Chat Task Management Chat conversation showing agent’s ability to handle complex task management and provide structured responses

Real-Time Agent Actions Example

During conversations, agents can perform complex multi-step actions like sending emails, conducting research, and managing tasks. Here’s an example showing an agent handling email management: Chat Conversation Example Extended chat conversation showing agent performing email tasks and providing detailed responses with timestamps

Interactive Features

Real-Time Status Display

Bottom Status Bar shows agent activity:
  • Searching: Agent retrieving information from knowledge sources
  • Web Searching: Performing live web searches for current information
  • Tool Calling: Using connected applications via MCP workflows
  • Memory Updates: Storing important conversation details
  • Message Processing: Generating responses and formulating replies

Agent Response Format

Dual Response System:
  • Text Display: Written responses appear in transcript
  • Voice Narration: Agent speaks responses aloud simultaneously
  • Rich Content: Support for formatted text, lists, and structured information
  • Action Confirmations: Notifications when agent performs tasks

Smart Memory Integration

Automatic Information Storage:
  • Personal Details: Names, preferences, contact information
  • Task Management: Important tasks, deadlines, and follow-ups
  • Context Retention: Key conversation points and decisions
  • Session Continuity: Information carries across multiple conversations

Agent Options and Settings

Access comprehensive agent controls through the gear icon in the top-left corner.

Agent Selection

Agent Dropdown:
  • Available Agents: All Worker and presenter agents appear in the dropdown menu
  • Switch Anytime: Change agents without losing conversation context
  • Agent-Specific: Each agent brings its own knowledge, presentation deck, and behavior
  • Seamless Transition: Conversation continues with the new agent context

Voice Selection

Voice Options:
  • Gemini voices: Sportsman, Customer support, Sarah, Brooke, Katie, Zemo, ajith, duaila, azj, ajz, sjl, brit, Swissen (shown when a Gemini LLM is active)
  • OpenAI Realtime voices: Alloy, Echo, Shimmer, Ash, Ballad, Coral, Sage, Verse, Cedar, Marin (shown when an OpenAI Realtime LLM is active)
  • Real-Time Switch: Voice changes take effect immediately
  • WebRTC Reconnection: Brief reconnection when changing voice settings

LLM Model Selection

AI Model Options:
  • OpenAI: GPT 4.1 mini, GPT 4.1, GPT 5, GPT 5 nano, GPT 5 mini
  • Gemini: Gemini 2.5 Flash Lite, Gemini 2.5 Pro, Gemini 2.5 Flash
  • OpenAI Realtime: GPT Realtime, GPT‑4o Realtime, GPT Realtime Mini
  • Groq hosted: GPT OSS 20B, GPT OSS 120B, Qwen3‑32B, Moonshotai Kimi K2
  • Performance Impact: Different models affect response speed, reasoning depth, and voice availability

Generate Images and Videos

Use the chat input bar to create media in just a few steps:
1

Pick an Agent and Model

Confirm the active persona and LLM using the gear icon in the Hub. This also sets the available voice for realtime commentary.
2

Prompt an Image

In the bottom input bar, describe the image you want (e.g., “Create an image of a futuristic city skyline at night”) and press Enter.
3

Review the Result

The agent replies with a thumbnail preview and confirmation once the image is ready. Click the thumbnail to view the full asset.
4

Transform into Video

Submit a follow-up prompt such as “Generate a short video based on this image.” The processed clip appears in the chat with an inline player.
Every generated asset is stored automatically in AI Drive. Visit Integrations → AI Drive to browse, filter, download, or share your media later.

Conversation History

Session Management:
  • Previous Conversations: Access to past chat sessions
  • Load History: Bring previous conversations into current session
  • Context Integration: Historical context informs current responses
  • Organized Records: Conversations organized by date and agent

Advanced Chat Capabilities

Web Search Integration

When agents perform web searches:
  • Real-Time Results: Search results appear as agent finds information
  • Source Citations: Clear attribution for web-sourced information
  • Interactive Elements: In Web Search scene, results appear as 3D widgets
  • Fact Verification: Agent can verify and cross-reference information

Tool Integration and Actions

Connected Application Access:
  • MCP Tools: Integration with 200+ applications
  • Email Management: Send emails through connected Gmail
  • Calendar Operations: Schedule meetings and manage appointments
  • CRM Updates: Update customer records and contact information
  • Phone Integration: Make calls via Twilio integration

Multi-Modal Content

Rich Interaction Support:
  • Website Integration: Display live websites in Zen scenes
  • Presentation Control: Navigate slides in Presentation scenes
  • Video Integration: Show YouTube videos while chatting
  • Document Processing: Handle uploaded PDFs and presentations
Website Integration Chat mode with integrated website display showing SEO webinar content alongside avatar interaction in Zen scene

Chat Session Management

Session Continuity

  • 20-Minute Timeout: Sessions automatically timeout after inactivity
  • Session Restart: Click Chat button to restart after timeout
  • Context Preservation: Important information retained across sessions
  • Page Refresh: Alternative method to restart stalled sessions

Mode Switching

Seamless Transitions:
  • Chat to Talk: Click Talk button to switch to voice mode
  • Context Retention: Conversation continues without interruption
  • Settings Preservation: Agent, voice, and model selections maintained
  • Real-Time Switch: Immediate transition between interaction modes

Performance Optimization

Optimal Chat Experience:
  • Stable Internet: Ensure reliable connection for WebRTC performance
  • Browser Permissions: Grant necessary audio permissions even for chat
  • Regular Updates: Keep browser updated for best compatibility
  • Memory Management: Clear browser cache if experiencing slowdowns

Best Practices

Effective Chat Communication

  • Clear Questions: Ask specific, well-formed questions
  • Context Provision: Provide relevant background information
  • Follow-Up Questions: Build on agent responses for deeper information
  • Task Specification: Be specific about desired actions or outcomes

Feature Utilization

  • Transcript Review: Use transcript to track important information
  • Agent Switching: Try different agents for varied perspectives
  • Tool Integration: Leverage connected apps for enhanced functionality
  • History Access: Review previous conversations for context

Troubleshooting

  • Connection Issues: Check for green dot before starting conversations
  • Response Delays: Allow time for complex searches and tool operations
  • Missing Transcript: Toggle transcript visibility using input bar control
  • Audio Problems: Check browser permissions even though primarily text-based

Integration with Other Features

Scene Compatibility

  • All Scenes: Chat mode works in every available scene
  • Interactive Widgets: Enhanced experience in Zen and Web Search scenes
  • Presentation Integration: Navigate presentations via chat commands
  • Video Wall: Chat while custom videos play in background

Avatar Consistency

  • Avatar Switching: Change avatars without losing chat context
  • Voice Matching: Avatar voice settings apply to chat narration
  • Character Persistence: Avatar personality maintained throughout chat
Ready to start chatting with your AI agents? Click the Chat button and begin your text-based conversation with intelligent voice narration!