Question 1

What is a speaker-aware voice assistant?

Accepted Answer

A speaker-aware voice assistant identifies who is speaking by voiceprint and adapts its behavior based on the speaker's identity and role. It uses voice biometrics to grant or deny access to specific commands, personalize responses, or restrict functionality to authorized users.

Question 2

How does voice biometric authentication work in this recipe?

Accepted Answer

Eagle Speaker Recognition compares the speaker's voice against enrolled voiceprints and returns a similarity score. If the score exceeds a configurable threshold, the speaker is identified, and their assigned role (admin or user) determines which commands they can execute. All processing runs on-device.

Question 3

How is this different from Alexa Voice ID, Google Voice Match, or Siri voice recognition?

Accepted Answer

Alexa Voice ID, Google Voice Match, and Siri all support some form of speaker recognition, but they are closed platforms tied to their respective ecosystems. For example, you cannot embed Alexa Voice ID into your own hardware product without joining Amazon's ecosystem, and all voice data flows through their cloud. Picovoice's Eagle Speaker Recognition is a licensable SDK that runs entirely on your hardware. Voiceprint enrollment and recognition happen on-device. No audio or voiceprint data is transmitted to Picovoice, Amazon, Google, Apple, or any third-party server. You control the hardware, the firmware, and the data.

Question 4

How accurate is the speaker recognition?

Accepted Answer

Eagle Speaker Recognition achieves 0.18% EER on VoxConverse, a widely used multi-speaker dataset containing real conversations across multiple languages. That is 2.7x lower than SpeechBrain (0.49%) and 3.9x lower than pyannote (0.70%). EER measures the point where false acceptance and false rejection rates are equal; a lower EER means fewer impostors get through and fewer genuine users get blocked. The admin_similarity_threshold parameter lets you tune the tradeoff between security (higher threshold, fewer false accepts) and convenience (lower threshold, fewer false rejects).

Question 5

Can someone spoof the voice authentication?

Accepted Answer

Eagle Speaker Recognition is not an anti-spoofing detector and does not perform liveness detection. Eagle compares the acoustic features of the speaker's voice. It is designed for convenience-level voice biometric authentication (device personalization, role-based access) rather than high-security authentication (financial transactions, facility access). So it does not distinguish between a live speaker and a high-quality recording or synthetic voice. Spoofing techniques, including voice cloning and deepfake audio, continue to advance. For high-security applications, combine voice biometrics with a second factor such as a PIN, badge, or biometric sensor.

Question 6

How many speakers can be enrolled?

Accepted Answer

There is no hard limit on the number of enrolled speaker profiles. Each profile is a lightweight file generated from a few seconds of speech. The application passes all profiles to Eagle Speaker Recognition on each recognition call, and Eagle returns similarity scores for each.

Question 7

Does the speaker recognition work offline?

Accepted Answer

Yes. Eagle Speaker Recognition runs entirely on-device. Voiceprint enrollment and recognition both happen locally. No audio or voiceprint data is transmitted to Picovoice or any third-party server.

Question 8

Can I use speaker recognition without the voice command pipeline?

Accepted Answer

Yes. Eagle Speaker Recognition works as a standalone SDK. You can use it for speaker identification or verification in any application without Porcupine, Rhino, or Orca. This recipe combines all four to demonstrate a complete speaker-aware voice assistant.

Question 9

Does the voice assistant store or transmit audio?

Accepted Answer

No. All audio is processed on the device and discarded. Voiceprint profiles are stored locally as .egl files. Nothing is transmitted to Picovoice or any third-party cloud. Picovoice has no data controller relationship with your end users.

Question 10

How can I get technical support?

Accepted Answer

Visit the GitHub pico-cookbook Speaker-Aware Voice Assistant Recipe, where you can find the open-source demo code and create an issue for demo-related technical questions.

Build an AI Voice Assistant with Speaker Recognition for Personalization and Authentication

One on-device voice AI pipeline for speaker identification, voice commands, and personalized responses

Always-on, low-power wake word detection for embedded devices

Structured voice commands without intermediate speech-to-text

On-device voiceprint matching for identity and role verification

Spoken responses at 29 MB peak memory

From smart homes to clinical workflows: voice authentication and personalization for real devices

Smart home access control

Role-based voice commands on shared enterprise equipment

Voice-authenticated access to clinical systems

Role-based voice commands at retail stores

AI voice assistant with speaker recognition: Code example

Prerequisites

Usage

Create a virtual environment

Activate the virtual environment

Install dependencies

Train a wake word

Design your voice commands

Enroll speakers

Run the voice assistant

More recipes from picoCookbook

FAQ