Showing 140 open source projects for "audio codec"

View related business solutions
  • Auth0 for AI Agents now in GA Icon
    Auth0 for AI Agents now in GA

    Ready to implement AI with confidence (without sacrificing security)?

    Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
    Start building today
  • Lightspeed golf course management software Icon
    Lightspeed golf course management software

    Lightspeed Golf is all-in-one golf course management software to help courses simplify operations, drive revenue and deliver amazing golf experiences.

    From tee sheet management, point of sale and payment processing to marketing, automation, reporting and more—Lightspeed is built for the pro shop, restaurant, back office, beverage cart and beyond.
    Learn More
  • 1
    Audiogen Codec

    Audiogen Codec

    48khz stereo neural audio codec for general audio

    AGC (Audiogen Codec) is a convolutional autoencoder based on the DAC architecture, which holds SOTA. We found that training with EMA and adding a perceptual loss term with CLAP features improved performance. These codecs, being low compression, outperform Meta's EnCodec and DAC on general audio as validated from internal blind ELO games. We trained (relatively) very low compression codecs in the pursuit of solving a core issue regarding general music and audio generation, low acoustic quality, and audible artifacts, which hinder industry use for these models. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    WavTokenizer

    WavTokenizer

    SOTA discrete acoustic codec models with 40/75 tokens per second

    WavTokenizer is a state-of-the-art discrete acoustic codec designed specifically for audio language modeling, capable of compressing 24 kHz audio into just 40 or 75 tokens per second while preserving high perceptual quality. It is built to represent speech, music, and general audio with extremely low bitrate, making it ideal as a front-end for large audio language models like GPT-4o and similar architectures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and the other one to the user. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Repair-CRM Icon
    Repair-CRM

    For small companies that repair and maintenance customer machines

    All-In-One Solution with an Online Booking portal for automating scheduling & dispatching to ditch paperwork and improve the productivity of your technicians!
    Learn More
  • 5
    rtmp-rtsp-stream-client-java

    rtmp-rtsp-stream-client-java

    Library to stream in rtmp and rtsp for Android. All code in Java

    Library for streaming in RTMP and RTSP. All code in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    MMC is a commander-style media player for Windows, with native, hw accelerated video playing and translucent gui. Mpxplay is a console audio player for DOS and Win32 operating systems. x264vfw, x265vfw and xAV1vfw are video for windows encoder and decoder codecs, useful with VirtualDub.
    Leader badge
    Downloads: 267 This Week
    Last Update:
    See Project
  • 7

    opencore-amr

    Audio codecs extracted from Android Open Source Project

    Library of OpenCORE Framework implementation of Adaptive Multi Rate Narrowband and Wideband (AMR-NB and AMR-WB) speech codec. Library of VisualOn implementation of Adaptive Multi Rate Wideband (AMR-WB) encoder and Advanced Audio Coding (AAC) encoder. Modified library of Fraunhofer AAC decoder and encoder.
    Leader badge
    Downloads: 5,168 This Week
    Last Update:
    See Project
  • 8
    VidCoder

    VidCoder

    A Blu-ray, DVD and video file transcoder for Windows

    VidCoder is a Windows-based open-source video transcoding and ripping tool that provides a graphical interface built around standard command-line multimedia tools. It lets users convert video files (or rip DVDs/Blu-rays, when supported) into modern formats and codecs, making it useful for people who want to compress, re-encode, or transcode video content without dealing directly with low-level encoder settings. Because VidCoder integrates and automates the invocation of complex backend...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9
    BlackBelt CodecPack

    BlackBelt CodecPack

    A clean, lean CoDec Pack. FFDShow and LAV Combined.

    Contains support for popular formats. Works especially well with MediaPortal. LAV, ffdshow - why choose between when you can have both in one pack ! WMV/WMA, DivX, AVI, ASF, FLV, Ogg FLAC, HEV1, x264, x265 etc. NO SPYWARE, NO ADWARE, NO TOOLBARS, NO PLAYER - JUST PURE CODECS Windows XP / Vista / 7 / 8 / 10 - 32/64 bit.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Failed Payment Recovery for Subscription Businesses Icon
    Failed Payment Recovery for Subscription Businesses

    For subscription companies searching for a failed payment recovery solution to grow revenue, and retain customers.

    FlexPay’s innovative platform uses multiple technologies to achieve the highest number of retained customers, resulting in reduced involuntary churn, longer life span after recovery, and higher revenue. Leading brands like LegalZoom, Hooked on Phonics, and ClinicSense trust FlexPay to recover failed payments, reduce churn, and increase customer lifetime value.
    Learn More
  • 10
    Tuniac

    Tuniac

    Tuniac Media Player

    Tuniac is an iTunes style media player/manager for Windows. Advanced playlist editor, search as you type and queue support. Supports: MPEG-1 Audio (mp3, mp2, mp1) FLAC (flac, fla, oga, ogg) Advanced Audio Coding (aac, m4a, m4b, mp4) Apple Lossless Audio Codec aka ALAC (m4a) Windows Media Audio (wma) Vorbis (ogg) Opus (opus) WavPack (wv) TAK Audio (tak) TrueAudio (tta) Monkeys Audio (ape, mac) Musepack (mpc, mp+, mpp) OptimFrog (ofr, ofs) Shorten (shn) Dolby Digital (ac3) DSD (dff, dfs) PCM (wav, aif) CD Audio (cda) Speex (spx) MOD Formats (mod, mo3, xm, it, s3m, mtm) Game Audio Formats (adx, umx) MIDI (mid) Supporting radio streaming of most the above formats.
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11

    JosePlayer

    Play audio and video files and folders, CDs and DVDs

    It is an awesome multimedia player that plays any multimedia content. You only need to install a free directshow package codec to fully enjoy this media player.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Shutter Encoder

    Shutter Encoder

    Free professional video converter Windows|Mac|Linux

    Shutter Encoder is an video, audio and image converter based on FFmpeg and other great tools. It has been designed by video editors in order to be as accessible and efficient as possible. It's a swiss knife tool for any video editor. Link to website & downloads : https://www.shutterencoder.com - Without conversion: Cut without re-encoding, Replace audio, Rewrap, Conform, Merge, Extract, Subtitling, Video inserts - Sound conversions: WAV, AIFF, FLAC, ALAC, MP3, AAC, AC3, OPUS, OGG - Editing codecs: DNxHD, DNxHR, Apple ProRes, QT Animation, GoPro CineForm, Uncompressed YUV - Output codecs: H.264, H.265, VP8, VP9, AV1, OGV - Broadcast codecs: XDCAM HD422, AVC-Intra 100, XAVC, HAP - Old codecs: DV PAL, MJPEG, Xvid, WMV, MPEG - Archiving codec: FFV1 - Images creation: JPEG, Image - Burn & Rip: DVD, Blu-ray, DVD RIP - Analysis: Loudness & True Peak, Audio normalization, Cut detection, Black detection, Media, VMAF - Download: Web video
    Leader badge
    Downloads: 69 This Week
    Last Update:
    See Project
  • 13
    voxshare_gui

    voxshare_gui

    *VoxShare* is a simple Python-based push-to-talk multicast voice chat

    VoxShare is a simple Python-based push-to-talk multicast voice chat application with a sleek modern GUI built using CustomTkinter. Provided as python source code or compiled standalone windows application (no need to install anything).
    Downloads: 16 This Week
    Last Update:
    See Project
  • 14
    Apprentice Video

    Apprentice Video

    it's a video player, also works for music and pictures

    This player stands on the giant shoulders of FFmpeg. Audio rendering is accomplished via portaudio v19. Video rendering is via OpenGL, using fragment programs when possible. User interface is implemented with Qt 4/5/6. ASS/SSA subtitle rendering is implemented with libass. This player provides several performance options to enable adequate video playback on slow hardware: * skip loop filter * skip non-reference frames * skip color converter * reduce playback speed to accommodate slow video decoding This player supports playback of HDR video on non-HDR displays: * colorspace transform to BT.709 colorspace via an auto-generated 3D LUT * tone mapping from HDR to SDR (BT. 709) This player supports playback of MPEG-TS files containing multiple programs, timeline anomalies, and codec changes. ...
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 15
    howler.js

    howler.js

    Javascript audio library for the modern web

    howler.js is an audio library for the modern web. It defaults to Web Audio API and falls back to HTML5 Audio. This makes working with audio in JavaScript easy and reliable across all platforms. Additional information, live demos and a user showcase are available at howlerjs.com. Single API for all audio needs, defaults to Web Audio API and falls back to HTML5 Audio, handles edge cases and bugs across environments, supports all codecs for full cross-browser support, automatic caching for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Ant Movie Catalog

    Ant Movie Catalog

    Free program made to manage your collection of movies

    ...Import information from Internet (using scripts); by default it includes scripts for IMDB (US), DVDFR (FR), Allociné (FR) and lots of others. User-customizable links to do a search on movie websites. Information importation from various media files (audio & video codec, bitrates, resolution, framerate, size). Scripting technology, using Object Pascal language, allowing to modify catalog: Find & replace, moving field values, ... Printing, using customizable templates. Export to other formats: HTML (based on a template that you can modify or create yourself), SQL commands (to re-import data in a DBMS such as MySQL), CSV (text files, can be used as tables with Microsft Excel for example). ...
    Downloads: 36 This Week
    Last Update:
    See Project
  • 17
    Aeyae Remux

    Aeyae Remux

    GOP-based video editor (remuxer)

    ...Aeyae Remux can be used to stitch together files... within reason. Since it is only a remuxer -- stitched files must be of compatible format. This means stitched files must have the same audio/video codecs, and codec properties (width, height, sample rate, etc...). That said... Aeyae Remux currently doesn't check or enforce this restriction, so beware -- garbage input will likely produce garbage output. Linux binaries are provided as AppImage, built on Ubuntu 14.04. 8 GB of RAM (but more is better) is highly recommended if working with source files longer than 2 hours.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 18
    ViaVoip

    ViaVoip

    A portable peer to peer voice-chat/walkie-talkie.

    ViaVoip is a simple Voice Over IP application that can be used when you need to talk, chat, or send files through the internet, but you can't or don't want to make use of any third party services. Its peer to peer design allows the two end points to connect directly to each other, without any central server nor account registration. It runs on Windows, Linux, Mac OS X and Android, and is portable, that is you don't need any setup, just get a copy and run it from any storage...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 19

    Virtualdub Batch Video DeShake

    Batch to compress [and deshake] all videos [or images] in folder

    Installation: Execute "DeShakInst.BAT" VirtualDub2 44282; AviSynth+ 3.7.5 updated to C:\DVD DESHAK.BAT updated to C:\UT and added to PATH Usage: DESHAK task[s] [parameters] Tasks: tp1: deshake pass1 LOG generation for 2nd pass tp2: deshake pass2 and compress video and audio to MP3 tcomp: compress (no deshake) twav: extract WAV and/or uses external WAV audio Parameters (more in help): vEXT: video extension (ie: vmov), default: vAVI qN: h264 quality 1-9 (9=lossless), def: q3 (crf23) aN: mp3 quality 1-5, def: a3 (192k) * generates: ZZoriginalname.AVI * inner settings on first lines ie: Audio synch delay (line 46) Min Requirements: XP; Win7x64 for aviSynth video NoiseReduction Klite Mega Codec Pack (with LAME encoder) Other Utilities: LOG2CHAPS.BAT generate _OGG.txt chapters @ scene change VID2AUD.BAT extract Audios VID2MKV.BAT multiplex vid+aud+chapters VIDJOIN.BAT merges videos to MKV
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    VALL-E

    VALL-E

    PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)

    We introduce a language modeling approach for text to speech synthesis (TTS). Specifically, we train a neural codec language model (called VALL-E) using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather than continuous signal regression as in previous work. During the pre-training stage, we scale up the TTS training data to 60K hours of English speech which is hundreds of times larger than existing systems. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 21
    Lyra

    Lyra

    A Very Low-Bitrate Codec for Speech Compression

    lyra is a neural audio codec designed to deliver intelligible, natural-sounding speech at extremely low bitrates, making real-time communication viable on constrained networks. It replaces hand-engineered codecs with learned models that capture speech characteristics more efficiently and reconstruct waveforms with a neural vocoder. The system targets mobile-class hardware, balancing latency and quality so it can run in real-time on phones.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    mplayer-for-windows

    mplayer-for-windows

    MPlayer & MEncoder Builds for Windows

    Leader badge
    Downloads: 146 This Week
    Last Update:
    See Project
  • 23
    EnCodec

    EnCodec

    State-of-the-art deep learning based audio codec

    Encodec is a neural audio codec developed by Meta for high-fidelity, low-bitrate audio compression using end-to-end deep learning. Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    play folder

    play folder

    Player Folder è un semplice ed efficace visore di file multimediali.

    FolderPlayer è un semplice ed efficace visore di file multimediali (audio, video e immagini). Il programma non effettua nessuna installazione di codec, ma utilizza le risorse installate sul PC. Per i formati audio e le immagini è autosufficiente, ma per i video, specie quelli meno comuni, bisogna installare codec di terze parti (se non già presenti). Sistemi operativi supportati : Windows™ x86 / 64bit
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    JAFG - Just Another FFmpeg GUI
    JAFG or Just Another FFmpeg GUI is an interface to FFmpeg. JAFG allows conversion of audio to audio file, conversion of video to video files. JAFG allows changing of the Audio Bitrate, Audio Sampling Rate, Audio Channels, Video Codec, Video Bitrate, Video Size, Aspect, Framerate. JAFG also allows converting to DVD, DV, VCD, SVCD and can be pal, ntsc, film. JAFG allows capture of screenshots and screen recording, and Youtube downloading.
    Downloads: 1 This Week
    Last Update:
    See Project