← All repositories

livekitlivekit-wakeword

101
0
View on GitHub
101 stars15 forksPythonApache-2.0Updated 2d ago1view

Tags

Maturity
BetaBetaActively developed but pre-1.0 — no stable release tagged yet.View all repositories tagged
Persona
Backend EngineerBackend EngineerEngineers building server-side services, APIs, and infrastructure components.View all repositories taggedAI Platform EngineerAI Platform EngineerEngineers building infrastructure and tooling for AI agents and LLM-powered applications.View all repositories tagged
License Category
PermissivePermissivePermissive open-source license (MIT, Apache 2.0, BSD) allowing use without copyleft obligations.View all repositories tagged
Built With
ONNX RuntimeONNX RuntimeCross-platform inference engine for running exported wake word classifier ONNX models.View all repositories taggedRustRustSystems programming language used for CubeAPI, CubeHypervisor, and CubeShim components.View all repositories taggedPyTorchPyTorchMachine learning framework used for classifier training and checkpoint management.View all repositories taggedPydanticPydanticUsed for validated YAML configuration models across the training pipeline.View all repositories tagged
Security Posture
UnratedUnratedOSSF Scorecard has not yet scanned this repository — security posture is unknown.View all repositories tagged
Maintainer Model
Single-personSingle-personMaintained primarily by one developer — top contributor accounts for most commits.View all repositories tagged
Form Factor
LibraryLibraryImportable package or module that adds capabilities to an existing application.View all repositories tagged
Platform
MobileMobileRuns on iOS and Android mobile devices.View all repositories taggedCLICLICommand-line interface as the primary interaction surface.View all repositories taggedServerServerRuns on server or bare-metal hardware, not a hosted cloud service.View all repositories taggedDesktopDesktopRuns as a native desktop application on macOS and Linux.View all repositories tagged
Use case
Drop-in replacement for openWakeWordDrop-in replacement for openWakeWordWhen my current openWakeWord deployment has too many false triggers, I want to swap in livekit-wakeword using the same ONNX model format and inference API, so I can reduce FPPH without rewriting my pipeline.View all repositories taggedTrain wake words in 30+ languagesTrain wake words in 30+ languagesWhen I am building a voice assistant for a non-English market, I want to train a wake word model in Chinese, Japanese, or Arabic using VoxCPM2 synthetic speech, so I can ship localized voice activation without building a multilingual dataset from scratch.View all repositories taggedWake word detection in iOS / macOS appsWake word detection in iOS / macOS appsWhen I am building a hands-free iOS or macOS app, I want a Swift wake word library that dispatches inference to the Neural Engine via CoreML, so I can activate features by voice without battery impact or network latency.View all repositories taggedOn-device wake word with low false-positive rateOn-device wake word with low false-positive rateWhen my wake word model triggers too often on background speech, I want a conv-attention classifier that models temporal phoneme ordering, so I can achieve fewer than 0.1 false positives per hour without sacrificing recall.View all repositories taggedTrain a custom wake word from scratchTrain a custom wake word from scratchWhen I want to add a custom voice trigger to my application, I want to train a wake word model from a text phrase using synthetic TTS data, so I can ship a production-quality hotword detector without recording real audio.View all repositories taggedRun wake word training on cloud GPUsRun wake word training on cloud GPUsWhen local GPU resources are insufficient for large-scale wake word training, I want to launch a SkyPilot job that runs the full pipeline on a cloud instance, so I can train production-scale models without owning a GPU server.View all repositories taggedTrigger a LiveKit voice agent on wake wordTrigger a LiveKit voice agent on wake wordWhen a user is in the same room as a LiveKit-connected device, I want the AI agent to wake and join the room automatically when it hears a predefined phrase, so I can build a hands-free voice assistant experience.View all repositories tagged
Ecosystem
PythonPythonBuilt with the Python language.View all repositories tagged
Status
ActiveActiveReceives recent commits and releases, issues are being addressed.View all repositories tagged
Features
Checkpoint averaging for robust final modelCheckpoint averaging for robust final modelAverage weights of top-performing checkpoints to produce a smoother, more generalizable final model.View all repositories taggedAsync microphone listener with debouncingAsync microphone listener with debouncingCapture live microphone audio and emit detection events asynchronously with configurable debounce.View all repositories taggedDET curve evaluation (AUT, FPPH, recall)DET curve evaluation (AUT, FPPH, recall)Evaluate ONNX wake word models with DET curves, AUT, and false-positives-per-hour metrics.View all repositories taggedWake word detectionWake word detectionDetect a specific spoken phrase in a continuous audio stream and return a confidence score.View all repositories taggedONNX export with optional INT8 quantizationONNX export with optional INT8 quantizationExport trained PyTorch classifier to ONNX for cross-platform deployment; supports INT8 dynamic quantization.View all repositories taggedVoice pipelineVoice pipelineSTT/TTS pipeline for real-time voice interaction on macOS, iOS, Android, and Discord voice channels.View all repositories taggedMultilingual TTS synthesis (30+ languages)Multilingual TTS synthesis (30+ languages)Generate wake word training data in 30 languages plus Chinese dialects via VoxCPM2 voice design.View all repositories taggedSynthetic training data generationSynthetic training data generationGenerate positive and adversarial-negative training clips via TTS without recording real audio.View all repositories tagged
License
Apache 2.0Apache 2.0Apache License 2.0 — permissive license with patent grant and attribution requirement.View all repositories tagged

Documentation

8 pages indexed · 1,169 words
  • READMElivekit-wakeword — Wake Word Librarygithub.com/livekit/livekit-wakeword/blob/main/README.md
  • Architecture Overviewgithub.com/livekit/livekit-wakeword/blob/main/docs/overview.md
  • Export & Inference APIgithub.com/livekit/livekit-wakeword/blob/main/docs/export-and-inference.md
  • Training Pipelinegithub.com/livekit/livekit-wakeword/blob/main/docs/training.md
  • Data Generation Pipelinegithub.com/livekit/livekit-wakeword/blob/main/docs/data-generation.md
  • Augmentation Pipelinegithub.com/livekit/livekit-wakeword/blob/main/docs/augmentation.md
  • Feature Extraction Pipelinegithub.com/livekit/livekit-wakeword/blob/main/docs/feature-extraction.md
  • Evaluation — DET curves, AUT, FPPHgithub.com/livekit/livekit-wakeword/blob/main/docs/evaluation.md