Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech
Go to file
2021-11-22 23:41:46 +04:00
app Remove DeepSpeechService and replace Catalan to Russian 2021-11-22 23:41:46 +04:00
gradle/wrapper Update to latest Vosk version 2021-08-03 21:18:32 +02:00
.gitattributes Working Vosk RecognitionService 2020-11-19 22:49:02 +01:00
.gitignore Update to latest Vosk version 2021-08-03 21:18:32 +02:00
build.gradle Update to latest Vosk version 2021-08-03 21:18:32 +02:00
demo.gif Update README 2020-12-03 14:22:09 +01:00
gradle.properties Update to latest Vosk version 2021-08-03 21:18:32 +02:00
gradlew Working Vosk RecognitionService 2020-11-19 22:49:02 +01:00
gradlew.bat Working Vosk RecognitionService 2020-11-19 22:49:02 +01:00
LICENSE Add LICENSE and NOTICE 2020-12-03 14:02:11 +01:00
NOTICE Add LICENSE and NOTICE 2020-12-03 14:02:11 +01:00
README.md Update README 2020-12-03 14:22:09 +01:00
settings.gradle Working Vosk RecognitionService 2020-11-19 22:49:02 +01:00

LocalSTT

(Jump to english)

[Català]

Nota: Aquesta aplicació de moment només és una prova de concepte

LocalSTT és una aplicació per Android que proporciona reconeixement automàtic de la parla sense necessitat de conexió a internet ja que tot el processament és local al mòbil.

Això és possible gràcies a:

  • un RecognitionService que utilitza la llibreria de Vosk
  • un RecognitionService que utilitza la lliberia de Mozilla Deepspeech
  • una Activity que gestiona intents RECOGNIZE_SPEECH entre altres

El codi és actualment una prova de concepte i es basa fortament en els següents projectes:

LocalSTT hauria de funcionar amb la majoria de teclats i aplicacions que implementen la funció de reconeixement de veu a través d'un intent RECOGNIZE_SPEECH o directament fent servir la classe SpeechRecognizer d'Android. Ha estat provada amb èxit fent servir les següent aplicacions en un terminal Android 9:

Us podeu descarregar un APK que inclou models de Vosk i DeepSpeech pel català aquí.

[English]

Note: This application is just a proof of concept for now

LocalSTT is an Android application that provides automatic speech recognition services without needing internet connection as all processing is done locally on your phone.

This is possible thanks to:

  • a RecognitionService wrapping the Vosk library
  • a RecognitionService wrapping Mozilla's DeepSpeech library
  • an Activity that handles RECOGNIZE_SPEECH intents amongst others

The code is currently just a PoC strongly based on:

LocalSTT should work with all keyboards and applications implementing speech recognition through the RECOGNIZE_SPEECH intent or Android's SpeechRecognizer class. It has been successfully tested using the following applications on Android 9:

You can download a pre-built binary with Vosk and DeepSpeech models for catalan here.

If you want to use the application with your language just replace the models in app/src/main/assets/sync and rebuild the application.

Demo

LocalSTT in action