Best Open Source Python Speech Software for Linux

Python Speech Software for Linux

Speech Python Linux Clear Filters

Browse free open source Python Speech Software for Linux and projects below. Use the toggles on the left to filter open source Python Speech Software for Linux by OS, license, language, programming language, and project status.

Auth0 for AI Agents now in GA
Ready to implement AI with confidence (without sacrificing security)?

Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.

Start building today
Say goodbye to broken revenue funnels and poor customer experiences
Connect and coordinate your data, signals, tools, and people at every step of the customer journey.

LeanData is a Demand Management solution that supports all go-to-market strategies such as account-based sales development, geo-based territories, and more. LeanData features a visual, intuitive workflow native to Salesforce that enables users to view their entire lead flow in one interface. LeanData allows users to access the drag-and-drop feature to route their leads. LeanData also features an algorithms match that uses multiple fields in Salesforce.

Learn More
1

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Downloads: 25 This Week

Last Update: 2021-04-08
See Project
2

SpeechRecognition

Speech recognition module for Python

Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. To hack on this library, first make sure you have all the requirements listed in the "Requirements" section.

Downloads: 11 This Week

Last Update: 2025-11-19
See Project
3

VoiceCode Programming by Voice Toolbox

VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The aim of the project is to make programming through voice input as easy and productive as with mouse and keyboard. For install, Use subversion, as described in this page: http://sourceforge.net/apps/mediawiki/voicecode/index.php?title=VCode_1_Doc/InstallationManual.

5 Reviews

Downloads: 3 This Week

Last Update: 2013-03-10
See Project
4

FM2TXT

RtlSdr listen to radio, recognize audio, and writes text file log

Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt

Downloads: 5 This Week

Last Update: 2017-12-17
See Project
Business Automation Software for SMBs
Fed up with not having the time, money and resources to grow your business?

The only software you need to increase cash flow, optimize resource utilization, and take control of your assets and inventory.

Learn More
5

Audio Trigger

Performs actions on detected volume threshold Examples : - Launch music on clap - Launch speech recording when you start speaking - Launch guard webcam when a significant sound is detected - Increase or decrease headphones volume when ambient noise pass

Downloads: 1 This Week

Last Update: 2013-04-01
See Project
6

QWave

QWave: Qt-based waveform display and audio playback class library.

Downloads: 1 This Week

Last Update: 2013-05-01
See Project
7

Voice keyboard

Voice keyboard/dictation. Aims to be a total substitute for a keyboard. Spell out words letter by letter (using code: alpha, bravo, ..). Arrow keys, modifiers work. Speak whole words (but whole word accuracy is not good). Attach commands to some word

Downloads: 1 This Week

Last Update: 2015-04-20
See Project
8

A.L.V.I. Bot

A.L.V.I. e' nato per essere un semplice ma modulare Bot, in grado di interagire con l'essere umano attraverso il linguaggio naturale ed eseguire svariati compiti, come leggere ad alta voce Mail, notizie, Feeds. Tutto in Italiano!

Downloads: 0 This Week

Last Update: 2014-12-21
See Project
9

AIChatbot

An extensible (by plugin) chatbot project

Downloads: 0 This Week

Last Update: 2015-07-02
See Project
Easy-to-use Business Software for the Waste Management Software Industry
Increase efficiency, expedite accounts receivables, optimize routes, acquire new customers, & more!

DOP Software’s mission is to streamline waste and recycling business’ processes by providing them with dynamic, comprehensive software and services that increase productivity and quality of performance.

Learn More
10

ASR-Builder

ASR-Builder provides an easy-to-use interface to the HTK toolkit, that allows users to build ASR systems. ASR-Builder provides a platform that performs house-keeping tasks when using HTK and also provides default training/testing/recognition scripts.

Downloads: 0 This Week

Last Update: 2013-04-26
See Project
11

ASTA - Auto. Subtitle Timing Annotator

A collection of scripts and programs to automatically annotate video/audio for subtitles. Basically relies on a MARSYAS (Music Analysis, Retrieval and Synthesis for Audio Signals) plug-in for detecting human voice in polyphonic recordings.

Downloads: 0 This Week

Last Update: 2014-04-24
See Project
12

AarTon

AarTon is an automated text-to-speech application. It allows user to enter text in a web-based front-end and render these texts via a multi-channel sound card.

Downloads: 0 This Week

Last Update: 2013-11-14
See Project
13

Annotation Graph Toolkit

AGTK is a suite of software components for building tools for annotating linguistic signals, time-series data which documents any kind of linguistic behavior (e.g. audio, video). The internal data structures are based on annotation graphs.

Downloads: 0 This Week

Last Update: 2013-04-25
See Project
14

Book2m4b

This is a Linux project that acts as a front end to cdparanoia, sox, and ffmpeg with the hope of making it incredibly simple to rip many audiobook cds into one mono, audiobook (m4b) format file for use in audio players capable of playing audiobooks.

Downloads: 0 This Week

Last Update: 2019-03-16
See Project
15

Eve

Eve is a AI project written in python that takes commands verbally or textually to control the computer and eveyday functions.

Downloads: 0 This Week

Last Update: 2013-04-03
See Project
16

InproTK

An Incremental Spoken Dialogue Processing Toolkit

InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/

Downloads: 0 This Week

Last Update: 2015-06-16
See Project
17

Moshi

A speech-text foundation model for real time dialogue

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and the other one to the user. At inference, the stream from the user is taken from the audio input, and the one for Moshi is sampled from the model's output. Along these two audio streams, Moshi predicts text tokens corresponding to its own speech, its inner monologue, which greatly improves the quality of its generation. A small Depth Transformer models inter codebook dependencies for a given time step, while a large, 7B parameter Temporal Transformer models the temporal dependencies.

Downloads: 0 This Week

Last Update: 2024-11-05
See Project
18

Open Interface for Speech Synthesis

The Open Interface for Speech Synthesis (OISS) provides an interface to speech synthesis hardware and software for end-user applications under Unix.

Downloads: 0 This Week

Last Update: 2013-02-21
See Project
19

PhoneBlogger

PhoneBlogger allows you to post to a weblog by phone. PhoneBlogger is written in VoiceXML, Python, and JavaScript.

Downloads: 0 This Week

Last Update: 2016-08-20
See Project
20

Python Gutenberg E-text Project

The PyGE (Python Gutenberg E-text) project is a suite of GUI desktop utilities written in Python to promote and facilitate awareness and enjoyment of works of literature that are available from the archives of Project Gutenberg.

Downloads: 0 This Week

Last Update: 2013-03-22
See Project
21

RNNLIB

RNNLIB is a recurrent neural network library for sequence learning problems. Applicable to most types of spatiotemporal data, it has proven particularly effective for speech and handwriting recognition. full installation and usage instructions given at http://sourceforge.net/p/rnnl/wiki/Home/

2 Reviews

Downloads: 0 This Week

Last Update: 2016-11-28
See Project
22

SWIPE' pitch extractor

This is a fast C implementation of Arturo Camacho's SWIPE' pitch extraction algorithm. See the project homepage for more about the advantages of the SWIPE' algorithm. swipe-1.0.tar.gz contains the current source, which should compile quite neatly.

Downloads: 0 This Week

Last Update: 2013-04-11
See Project
23

Skimpy PNG/ASCII/WAVE tools

A collection of tools for generating audio and visual (PNG/HTML/WAVE) for use in web sites including CAPTCHA challenges and PNG image creation tools with Javascript mouse tracking support.

Downloads: 0 This Week

Last Update: 2013-04-01
See Project
24

SoccerPhone

SoccerPhone provides lives soccer scores by phone. The only league currently supported is US Major League Soccer. Support for Soccernet is under development. SoccerPhone is written in VoiceXML, Python, and JavaScript.

1 Review

Downloads: 0 This Week

Last Update: 2013-02-25
See Project
25

Speaking gmail checker

Python + espeak gmail checker

Script that periodically checks your gmail account and speaks if you have unread email. Features: - logs into, and checks your gmail accounts - if you have unread email: - textual info about it appears in console/terminal - "you have X unread email" is spoken using espeak - if you have no unread email: - textual info about it appears in console/terminal Requirements: - POSIX OS - python: import imaplib import os import time - espeak - soundcard, speakers/headphones Installation/running: - modify gmail.py inserting your username and password - modify gmail.py inserting desired delay value in miliseconds - run: python gmail.py Author: pkrawczak@gmail.com 2013-02-06 13:54:51 Caution: Even though this program uses SSL use only at your own risk.

Downloads: 0 This Week

Last Update: 2016-07-26
See Project