Blog

Blog Thumbnail
A Complete Guide to Medical Language Models (Medical LLMs)
January 23, 2026 · 6 min read

Medical Language Models (Medical LLMs or Healthcare LLMs) are AI systems specifically trained on clinical literature, medical records, and healthcare data to understand medical terminology, generate clinical documentation, and assist with diagnostic reasoning.

Blog Thumbnail
Complete Guide to Real-Time Transcription (2026)
January 23, 2026 · 12 min read

Real-time transcription converts speech to text instantly with <1 second latency as someone speaks. It processes audio continuously, enabling use cases such as live captions, meeting transcription, voice assistants, and accessibility features across industries.

Blog Thumbnail
iOS Real-Time Noise Suppression SDK: Swift Tutorial
January 21, 2026 · 3 min read

Integrate real-time noise suppression into your iOS app with the Koala iOS SDK. On-device noise supression for real-time communication apps.

Blog Thumbnail
ML Kit Android Speech-to-Speech Translation: Complete Kotlin Tutorial
January 21, 2026 · 5 min read

Build ML Kit Android speech-to-speech translation with Kotlin. Complete guide using Cheetah STT, Google ML Kit Translation, and Orca TTS for on-device voice translation.

Blog Thumbnail
Build a Real-Time Meeting Summarization Tool in Python
January 19, 2026 · 2 min read

Complete guide to building a real-time meeting summarization tool in Python with streaming speech-to-text and AI summaries. Full code included.

Blog Thumbnail
Build an AI Voice Note-Taking App with Python
January 19, 2026 · 2 min read

Complete guide to building a voice note-taking app in Python with wake word activation, stop phrase control, and on-device transcription. Full code included.

Blog Thumbnail
How to Play Audio in Python: PCM Audio Playback Guide
January 19, 2026 · 4 min read

Learn how to play audio in Python with PvSpeaker. Stream PCM audio output for text-to-speech, audio synthesis, and real-time audio playback on Windows, macOS, and Linux.

Blog Thumbnail
How to Record Audio in React Native for iOS and Android
January 19, 2026 · 3 min read

Learn how to record audio in React Native apps for Android and iOS. Capture PCM microphone input for speech recognition, voice commands, and real-time audio processing.

Blog Thumbnail
Automatic Punctuation and Truecasing with Python Speech-to-Text
January 16, 2026 · 3 min read

Learn how to enable automatic punctuation and correct casing in speech-to-text with Python. Get formatted transcripts with periods, commas, and capitalization.

Blog Thumbnail
Complete Guide to Building HIPAA-Compliant Medical Voice AI Agent
January 16, 2026 · 4 min read

Build HIPAA-compliant medical voice agent in Python with on-device speech processing. Complete tutorial with wake word detection, real-time STT, and TTS.

Blog Thumbnail
Run Local Large Language Models in C: Cross-Platform LLM Inference
December 23, 2025 · 5 min read

Learn how to run LLMs locally in C across Linux, Windows, macOS, and Raspberry Pi with streaming text generation.

Blog Thumbnail
Step-by-Step Guide: Add Speaker Diarization to OpenAI Whisper in C++
December 19, 2025 · 5 min read

Step-by-step guide to adding speaker diarization to OpenAI Whisper STT in C++ using Falcon Speaker Diarization for multi-speaker transcription.

Blog Thumbnail
Complete Tutorial: Voice Activity Detection in C
December 18, 2025 · 3 min read

Voice Activity Detection (VAD) is a core building block for speech and audio systems, used to determine when human speech is present in an audio stream.

Blog Thumbnail
How to Implement Cross-Platform Noise Suppression in C
December 18, 2025 · 4 min read

Learn how to implement real-time noise cancellation in C across Linux, Windows, macOS, and Raspberry Pi.

Blog Thumbnail
Complete MCP Tutorial: How to Build a Local MCP Voice Assistant in Python
December 17, 2025 · 6 min read

Learn how to build a local MCP voice assistant using a local LLM to handle function calling, speech-to-text, text-to-speech, and external API integration in this step-by-step MCP tutorial.

Blog Thumbnail
Build Voice Banking Assistant with Python SDK
December 16, 2025 · 3 min read

Build a banking voice AI agent with custom wake words and voice activated banking features for secure and compliant financial applications.