Question 1

What is voice guided maintenance and inspection?

Accepted Answer

Voice guided maintenance and inspection is a hands-free DVIR and equipment-inspection workflow where a wake word activates the app, the app asks each inspection step out loud, and the technician answers in natural speech. The app captures both structured slots, such as asset ID, oil condition, tire condition, or service status, and free-form notes, all without typing on a phone or tablet during a walkaround.

Question 2

How is this different from DVIR apps like Whip Around, Fleetio, or Verizon Connect?

Accepted Answer

Existing DVIR apps are tap-and-photo workflows on a phone screen. Picovoice's voice-guided pipeline turns the same DVIR or maintenance checklist into a hands-free, eyes-free flow — drivers and technicians keep their hands on the equipment while the app prompts each step. The pipeline runs entirely on-device, so it works in shop bays, depots, and remote sites without connectivity.

Question 3

Does voice guided inspection work without an internet connection?

Accepted Answer

Yes. Porcupine Wake Word, Rhino Speech-to-Intent, Cheetah Streaming Speech-to-Text, Orca Streaming Text-to-Speech, and Koala Noise Suppression all run locally. The full inspection workflow works offline — useful in shop bays with weak Wi-Fi, off-site depots, mining and construction sites, and rural transport routes.

Question 4

Can the captured report meet DOT DVIR compliance requirements?

Accepted Answer

Yes. The recipe captures structured slots (asset/unit ID, fluid condition, tire condition, service status) plus free-form notes. The output can be mapped to DOT-compliant DVIR formats and pushed to your existing fleet maintenance software (Fleetio, Whip Around, Samsara, Verizon Connect, AssetWorks, etc.) through their APIs. Voice replaces the typing, not the system of record.

Question 5

What happens in a noisy shop bay or roadside walkaround?

Accepted Answer

Koala Noise Suppression cleans audio before it reaches Cheetah Streaming Speech-to-Text and Rhino Speech-to-Intent, such as air tools, idling engines, traffic, generators. Rhino Speech-to-Intent is end-to-end, with intent accuracy that holds up where transcript-based pipelines collapse in noise. The same pipeline works in a quiet shop and on a busy ramp.

Question 6

How does the wake word work?

Accepted Answer

Porcupine Wake Word listens continuously on-device with very low CPU and battery usage, and only triggers the rest of the pipeline when the user speaks the chosen wake phrase. The wake phrase is fully customizable, you choose what your drivers and technicians say, in any supported language.

Question 7

Is operator audio sent to a third-party cloud?

Accepted Answer

No. Audio is processed locally on the device. Picovoice cannot access end-user audio. This removes processing-agreement and breach-surface concerns, which is important for fleets in regulated industries (food & pharma logistics, hazmat, defense logistics) and for fleets in jurisdictions with worker-voice rules.

Question 8

Can I customize the inspection slots and prompts for my equipment?

Accepted Answer

Yes. The Rhino context YAML defines the unit IDs, the inspection slots (oil, tires, service status — and any others you add), and the accepted phrasings. The Orca Text-to-Speech prompts the operator hears are fully configurable text. Cheetah Streaming Speech-to-Text accepts custom vocabulary for part numbers and equipment names.

Question 9

What hardware does the on-device voice guided inspection pipeline run on?

Accepted Answer

The full five-engine pipeline runs on commodity Android phones, iOS devices, rugged tablets from Honeywell and Zebra, and Linux-based fleet hardware. It also runs on Raspberry Pi for embedded telematics installs. No GPU, no NPU, and no dedicated voice hardware required.

Question 10

Which assets beyond trucks fit this workflow?

Accepted Answer

Trucks, trailers, forklifts and warehouse equipment, construction and mining equipment, agriculture equipment, aviation pre-flight checks, marine vessels, and stationary plant equipment. Any asset class with a structured inspection checklist with optional free-form notes maps onto the same Porcupine Wake Word, Orca Text-to-Speech, Rhino Speech-to-Intent, Cheetah Streaming Speech-to-Text, and Koala Noise Suppression pipeline.

Question 11

How can I get technical support for the voice guided maintenance and inspection demo?

Accepted Answer

Visit the GitHub pico-cookbook voice guided maintenance and inspection recipe where you can find the open-source demo code and create an issue for the demo-related technical questions or reach out to your Picovoice contact.

Build a hands-free DVIR and inspection app that runs on-device

Five on-device voice AI SDKs. One hands-free inspection loop.

2× more effective at shop and ramp noise. Same footprint.

Always-on inspection trigger at low CPU and battery cost.

Structured DVIR slots directly from speech.

Free-form defect notes transcribed in real time.

Natural-sounding TTS at 29 MB peak memory.

From DVIR walkarounds to plant maintenance

Hands-free DOT DVIR for fleet drivers and mechanics

On-device equipment inspection for construction, mining, and agriculture

Voice guided facility inspection for utilities, HVAC, and manufacturing

Voice guided pre-departure checklists for aviation and marine

On-device voice DVIR and inspection Python code example

Prerequisites

Usage

Create a virtual environment

Activate the virtual environment

Install dependencies

Train a wake word

Train the Speech-to-Intent model

Run the DVIR, maintenance and inspection demo

More recipes from picoCookbook

FAQ