VoiceVision - AI Assistant with Live Camera (Vision AI)

How to create a AI assistant with Live Camera feed


How to Use Live Camera Feed in Your AI Assistant

In this tutorial, we’ll guide you through integrating live camera feed into your AI assistant. This feature enables real-time vision analysis, allowing AI assistants to process visual inputs from a live camera. It can be used in various applications such as:

  • Medical screening (e.g., dermatologist AI assistants)
  • Employee onboarding
  • Interview sessions
  • Security verification
  • Interactive kiosk systems

By integrating live camera feed, your AI assistant can analyze real-time images and provide instant feedback or recommendations.

Prerequisites

Before enabling this feature, ensure that:

  • You have real-time AI voice assistant enabled in your settings.
  • You have a structured instruction prompt for the workflow.
  • You have configured client-side function calls for image capture and analysis.

Now, let’s get started!

Step 1: Configure Your AI Assistant

Go to the Assistant Page

  • Select the assistant you want to enable live camera feed for.
  • Click on the Settings icon.
  • Use a Sample Instruction Prompt
Act as a help desk expert for Smiley Skin Care, greet with welcome message and guiding users through the appointment booking process with a dermatologist. Ask relevant questions and conduct a screening process while maintaining a natural and helpful conversation. Follow the provided chatbot flow to structure your responses without explicitly mentioning the steps. If a step includes a function_call, execute the corresponding OpenAI function as needed.
 
{
  "flows": [
    {
      "name": "dermatologist appointment booking",
      "onboard_message": "Hey there! Welcome to Smiley Skin Care. Let's get your appointment booked! 😊",
      "steps": [
      	{
          "id": "step_1",
          "message": "Let's start the screening process now. Take a top-down photo of your head, ensuring the crown area is clearly visible. confirm when you ready",
          "function_call": "capture_frame_top_view",
           "input_name" : "capture_frame_top_view",
          "next": "step_2"
        },
      	{
          "id": "step_2",
          "message": "Let me capture a clear image of your frontal hairline, confirm when you ready.",
          "function_call": "capture_frame_frontal_hairline",
           "input_name" : "capture_frame_frontal_hairline",
          "next": "step_3"
        },
        {
          "id": "step_3",
          "message": "I’d love to learn more about you! What’s your name?",
          "input_name": "user_name",
          "next": "step_4"
        },
        {
          "id": "step_4",
          "message": "Can you share your mobile number? 📱",
          "input_name": "mobile_number",
          "next": "step_5"
        },
        {
          "id": "step_5",
          "message": "Got it! Now, could you tell us your age and gender?",
          "input_name": "age_gender",
          "next": "step_6"
        },
        {
          "id": "step_6",
          "message": "What hair concerns are you facing? We're here to assist!",
          "input_name": "hair_problem",
          "function_call": "capture_result_top_view",
          "next": "step_7"
        },
        {
          "id": "step_7",
          "message": "confirm your appointment date and time",
          "input_name": "appointment_datetime",
          "function_call": "capture_result_frontal_hairline"
          "next": "step_8"
        },
       {
          "id": "step_8",
          "message": "Here's your screening results {capture_frame_frontal_hairline}, {capture_frame_top_view}",
          
           "next": "step_9"
        },
        {
          "id": "step_9",
          "message": "Here's your appointment summary: \n\nName: {user_name} \nMobile: {mobile_number} \nAge & Gender: {age_gender} \nHair Concern: {hair_problem} \nHair Top view: {capture_frame_top_view} \nHair Front view: {capture_frame_frontal_hairline} \nPreferred Date & Time: {appointment_datetime} \n\nIs everything correct?",
          "input_name": "confirm_appointment",
          "options": [
            {
              "text": "Yes, confirm it! ✅",
              "next": "step_10"
            },
            {
              "text": "No, let's start over 🔄",
              "next": "step_1"
            }
          ]
        },
        {
          "id": "step_10",
          "message": "Awesome! Booking your appointment now... ⏳",
          "function_call": "register_appointment",
          "next": "step_11"
        },
        {
          "id": "step_11",
          "message": "You're all set! 🎉 Your appointment is confirmed. See you soon at Smiley Skin Care!"
        }
      ]
    }
  ]
}
  • The instruction prompt should include a structured workflow with event call actions.
  • Ensure that media vision is disabled before proceeding.
  • Click Save to apply the changes.

Step 2: Enable Live Camera Feed

  • Go to Advanced Settings
  • Ensure that Real-Time AI Voice Assistant is enabled.
  • Enable Client-Side Function Calls
  • Use a sample function JSON format to capture images for Vision AI processing.
{
  "tools": [
    {
      "type": "function",
      "name": "capture_frame_top_view",
      "description": "Analyze the top-view image from the hair fall screening process",
      "parameters": {
        "type": "object",
        "properties": {
          "prompt": {
            "type": "string",
            "default": "screening process for hair fall assessment: Analyze this top-view image from the hair fall screening process. Ensure your response is in text format and does not exceed 30 words."
          }
        }
      }
    },
    {
      "type": "function",
      "name": "capture_result_top_view",
      "description": "Analyze result of the top-view image from the hair fall screening process",
      "parameters": {}
    },
    {
      "type": "function",
      "name": "capture_frame_frontal_hairline",
      "description": "Analyze front-view image from the hair fall screening process",
      "parameters": {
        "type": "object",
        "properties": {
          "prompt": {
            "type": "string",
            "default": "screening process for hair fall assessment: Analyze this frontal hairline image from the hair fall screening process. Ensure your response is in text format and does not exceed 30 words."
          }
        }
      }
    },
    {
      "type": "function",
      "name": "capture_result_frontal_hairline",
      "description": "Analyze result of the frontal hairline image from the hair fall screening process",
      "parameters": {}
    },
    {
      "type": "function",
      "name": "register_appointment",
      "description": "register appointment",
      "parameters":  {
        "type": "object",
            "properties": {
                "user_name": {
                    "type": "string",
                    "description": "user name"
                },
                "mobile_number": {
                    "type": "string",
                    "description": "user mobile number"
                },
                "age_gender": {
                    "type": "string",
                    "description": "Age and gender"
                },
               "hair_problem": {
                    "type": "string",
                    "description": "hair problem description"
                },
              "appointment_datetime": {
                    "type": "string",
                    "description": "Appointment datetime"
                }
            }
             
        }
    }   
  ]
}
 

To capture images from the live camera feed, use function names starting with:

"capture_frame_" for image capture

"capture_result_" for retrieving analysis results

Enable Camera Access

This will activate the live camera feed for real-time analysis.

Click Save to apply your changes.

Step 3: Test Your AI Assistant with Live Camera Feed

Now that your AI assistant is set up with live camera feed, it’s time to test it!

  • Go to the Assistant Page
  • Click the Chat Icon to start an interaction.
  • Activate the Camera Feed to analyze real-time images.
  • Observe the AI assistant's response to the captured images.
  • For this tutorial, we have simulated a dermatologist screening process where the AI assistant analyzes skin conditions based on captured images.