How AI call assist is built
On-device Voice AI and Language SDKs in a single pipeline
On-device AI call assist combines Cheetah Streaming Speech-to-Text, Orca Streaming Text-to-Speech, picoLLM Inference, and Rhino Speech-to-Intent in a single local pipeline. Orca greets callers, Cheetah transcribes what they say in real time, picoLLM analyzes what's said and declines the call if it's suspicious. If it's not suspicious, Rhino captures the phone owner's decision — i.e., intent to direct Orca to respond accordingly. Most implementations require the user to select a source language upfront or still route at least one stage through a cloud API. This pipeline eliminates both constraints: every stage runs on the device.