Leopard Speech-to-Text
Flutter API

API Reference for the Flutter Leopard SDK (pub.dev)

Leopard

class Leopard { }

Class for the Leopard Speech-to-Text engine. Leopard must be initialized using create(). Resources should be cleaned when you are done using the delete() function.

Leopard.`getAvailableDevices()`

static Future<List<String>> getAvailableDevices() async

Lists all available devices that Leopard can use for inference. Entries in the list can be used as the device argument when initializing Leopard.

Returns

List<String> : A list of devices Leopard can run inference on.

Throws

LeopardException : If unable to get devices.

Leopard.`create()`

static Future<Leopard> create(
    String accessKey, 
    String modelPath,
    {
      String? device,
      enableAutomaticPunctuation = false,
      enableDiarization = false
    })

Static creator for initializing Leopard.

Parameters

accessKey String : AccessKey obtained from Picovoice Console.
modelPath String : Path to the file containing model parameters (.pv). Can be either a path that is relative to the project's assets folder or an absolute path to the file on device.
device String? : (Optional) The string representation of the device (e.g., CPU or GPU) to use. If set to best, the most suitable device is selected automatically. If set to gpu, the engine uses the first available GPU device. To select a specific GPU device, set this argument to gpu:${GPU_INDEX}, where ${GPU_INDEX} is the index of the target GPU. If set to cpu, the engine will run on the CPU with the default number of threads. To specify the number of threads, set this argument to cpu:${NUM_THREADS}, where ${NUM_THREADS} is the desired number of threads.
enableAutomaticPunctuation bool? : (Optional) Set to true to enable automatic punctuation insertion.
enableDiarization bool? : (Optional) Set to true to enable speaker diarization, which allows Leopard to differentiate speakers as part of the transcription process. Word metadata will include a speaker_tag to identify unique speakers.

Returns

Future<Leopard> an instance of the speech-to-text engine.

Throws

LeopardException : If not initialized correctly.

Leopard.`process()`

Future<LeopardTranscript> process(List<int>? frame)

Process a frame of pcm audio with the speech-to-text engine.

Parameters

frame List<int> : a frame of audio samples to be assessed by Leopard. The required audio format is found by calling .sampleRate to get the required sample rate. Audio must be single-channel and 16-bit linearly-encoded.

Returns

Future<LeopardTranscript>: LeopardTranscript object which contains the transcription results of the engine.

Throws

LeopardException : If process fails.

Leopard.`processFile()`

Future<LeopardTranscript> processFile(String path)

Processes a given audio file with the speech-to-text-engine.

Parameters

path String : Absolute path to the audio file. The supported formats are: 3gp (AMR), FLAC, MP3, MP4/m4a (AAC), Ogg, WAV and WebM.

Returns

Future<LeopardTranscript>: LeopardTranscript object which contains the transcription results of the engine.

Throws

LeopardException : If process fails.

Leopard.`delete()`

Future<void> delete()

Frees memory that was allocated for Leopard

Leopard.`sampleRate`

int get sampleRate

Getter for the audio sample rate required by Leopard.

Leopard.`version`

String get version

Getter for Leopard version string.

LeopardException

class LeopardException implements Exception { }

Exception thrown if an error occurs within Leopard:

class LeopardMemoryException extends LeopardException { }
class LeopardIOException extends LeopardException { }
class LeopardInvalidArgumentException extends LeopardException { }
class LeopardStopIterationException extends LeopardException { }
class LeopardKeyException extends LeopardException { }
class LeopardInvalidStateException extends LeopardException { }
class LeopardRuntimeException extends LeopardException { }
class LeopardActivationException extends LeopardException { }
class LeopardActivationLimitException extends LeopardException { }
class LeopardActivationThrottledException extends LeopardException { }
class LeopardActivationRefusedException extends LeopardException { }

LeopardTranscript

class LeopardTranscript

Class that contains results from Leopard's process functions.

LeopardTranscript.`transcript`

String get transcript

Getter for transcript data.

Returns

String: Inferred transcript.

LeopardTranscript.`words`

List<LeopardWord> get words

Getter for word metadata in the form of LeopardWords.

LeopardWord

class LeopardWord

Class that contains word metadata.

LeopardWord.`word`

String get word

Getter for the transcribed word.

LeopardWord.`startSec`

final double startSec

Start time of word in seconds.

LeopardWord.`endSec`

final double endSec

End time of word in seconds.

LeopardWord.`confidence`

final double confidence

Transcription confidence. It is a number within [0, 1].

LeopardWord.`speakerTag`

final int speakerTag

The speaker tag is -1 if diarization is not enabled during initialization; otherwise, it's a non-negative integer identifying unique speakers, with 0 reserved for unknown speakers.

Was this doc helpful?

Issue with this doc?

Leopard Speech-to-Text Flutter API

Leopard Speech-to-Text
Flutter API