Showing 65 open source projects for "audio codec"

View related business solutions
  • Auth0 for AI Agents now in GA Icon
    Auth0 for AI Agents now in GA

    Ready to implement AI with confidence (without sacrificing security)?

    Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
    Start building today
  • Teradata VantageCloud Enterprise is a data analytics platform for performing advanced analytics on AWS, Azure, and Google Cloud. Icon
    Teradata VantageCloud Enterprise is a data analytics platform for performing advanced analytics on AWS, Azure, and Google Cloud.

    Power faster innovation with Teradata VantageCloud

    VantageCloud is the complete cloud analytics and data platform, delivering harmonized data and Trusted AI for all. Built for performance, flexibility, and openness, VantageCloud enables organizations to unify diverse data sources, run complex analytics, and deploy AI models—all within a single, scalable platform.
    Learn More
  • 1
    Audiogen Codec

    Audiogen Codec

    48khz stereo neural audio codec for general audio

    AGC (Audiogen Codec) is a convolutional autoencoder based on the DAC architecture, which holds SOTA. We found that training with EMA and adding a perceptual loss term with CLAP features improved performance. These codecs, being low compression, outperform Meta's EnCodec and DAC on general audio as validated from internal blind ELO games. We trained (relatively) very low compression codecs in the pursuit of solving a core issue regarding general music and audio generation, low acoustic quality, and audible artifacts, which hinder industry use for these models. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    WavTokenizer

    WavTokenizer

    SOTA discrete acoustic codec models with 40/75 tokens per second

    WavTokenizer is a state-of-the-art discrete acoustic codec designed specifically for audio language modeling, capable of compressing 24 kHz audio into just 40 or 75 tokens per second while preserving high perceptual quality. It is built to represent speech, music, and general audio with extremely low bitrate, making it ideal as a front-end for large audio language models like GPT-4o and similar architectures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and the other one to the user. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Zendesk: The Complete Customer Service Solution Icon
    Zendesk: The Complete Customer Service Solution

    Discover AI-powered, award-winning customer service software trusted by 200k customers

    Equip your agents with powerful AI tools and workflows that boost efficiency and elevate customer experiences across every channel.
    Learn More
  • 5
    rtmp-rtsp-stream-client-java

    rtmp-rtsp-stream-client-java

    Library to stream in rtmp and rtsp for Android. All code in Java

    Library for streaming in RTMP and RTSP. All code in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    opencore-amr

    Audio codecs extracted from Android Open Source Project

    Library of OpenCORE Framework implementation of Adaptive Multi Rate Narrowband and Wideband (AMR-NB and AMR-WB) speech codec. Library of VisualOn implementation of Adaptive Multi Rate Wideband (AMR-WB) encoder and Advanced Audio Coding (AAC) encoder. Modified library of Fraunhofer AAC decoder and encoder.
    Leader badge
    Downloads: 4,694 This Week
    Last Update:
    See Project
  • 7
    Shutter Encoder

    Shutter Encoder

    Free professional video converter Windows|Mac|Linux

    Shutter Encoder is an video, audio and image converter based on FFmpeg and other great tools. It has been designed by video editors in order to be as accessible and efficient as possible. It's a swiss knife tool for any video editor. Link to website & downloads : https://www.shutterencoder.com - Without conversion: Cut without re-encoding, Replace audio, Rewrap, Conform, Merge, Extract, Subtitling, Video inserts - Sound conversions: WAV, AIFF, FLAC, ALAC, MP3, AAC, AC3, OPUS, OGG - Editing codecs: DNxHD, DNxHR, Apple ProRes, QT Animation, GoPro CineForm, Uncompressed YUV - Output codecs: H.264, H.265, VP8, VP9, AV1, OGV - Broadcast codecs: XDCAM HD422, AVC-Intra 100, XAVC, HAP - Old codecs: DV PAL, MJPEG, Xvid, WMV, MPEG - Archiving codec: FFV1 - Images creation: JPEG, Image - Burn & Rip: DVD, Blu-ray, DVD RIP - Analysis: Loudness & True Peak, Audio normalization, Cut detection, Black detection, Media, VMAF - Download: Web video
    Leader badge
    Downloads: 73 This Week
    Last Update:
    See Project
  • 8
    Apprentice Video

    Apprentice Video

    it's a video player, also works for music and pictures

    This player stands on the giant shoulders of FFmpeg. Audio rendering is accomplished via portaudio v19. Video rendering is via OpenGL, using fragment programs when possible. User interface is implemented with Qt 4/5/6. ASS/SSA subtitle rendering is implemented with libass. This player provides several performance options to enable adequate video playback on slow hardware: * skip loop filter * skip non-reference frames * skip color converter * reduce playback speed to accommodate slow video decoding This player supports playback of HDR video on non-HDR displays: * colorspace transform to BT.709 colorspace via an auto-generated 3D LUT * tone mapping from HDR to SDR (BT. 709) This player supports playback of MPEG-TS files containing multiple programs, timeline anomalies, and codec changes. ...
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 9
    howler.js

    howler.js

    Javascript audio library for the modern web

    howler.js is an audio library for the modern web. It defaults to Web Audio API and falls back to HTML5 Audio. This makes working with audio in JavaScript easy and reliable across all platforms. Additional information, live demos and a user showcase are available at howlerjs.com. Single API for all audio needs, defaults to Web Audio API and falls back to HTML5 Audio, handles edge cases and bugs across environments, supports all codecs for full cross-browser support, automatic caching for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Top Corporate LMS for Training | Best Learning Management Software Icon
    Top Corporate LMS for Training | Best Learning Management Software

    Deliver and Track Online Training and Stay Compliant - with Axis LMS!

    Axis LMS enables you to deliver online and virtual learning and training through a scalable, easy-to-use LMS that is designed to enhance your training, automate your workflows, engage your learners and keep you compliant.
    Learn More
  • 10
    Aeyae Remux

    Aeyae Remux

    GOP-based video editor (remuxer)

    ...Aeyae Remux can be used to stitch together files... within reason. Since it is only a remuxer -- stitched files must be of compatible format. This means stitched files must have the same audio/video codecs, and codec properties (width, height, sample rate, etc...). That said... Aeyae Remux currently doesn't check or enforce this restriction, so beware -- garbage input will likely produce garbage output. Linux binaries are provided as AppImage, built on Ubuntu 14.04. 8 GB of RAM (but more is better) is highly recommended if working with source files longer than 2 hours.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    ViaVoip

    ViaVoip

    A portable peer to peer voice-chat/walkie-talkie.

    ViaVoip is a simple Voice Over IP application that can be used when you need to talk, chat, or send files through the internet, but you can't or don't want to make use of any third party services. Its peer to peer design allows the two end points to connect directly to each other, without any central server nor account registration. It runs on Windows, Linux, Mac OS X and Android, and is portable, that is you don't need any setup, just get a copy and run it from any storage...
    Downloads: 24 This Week
    Last Update:
    See Project
  • 12

    Virtualdub Batch Video DeShake

    Batch to compress [and deshake] all videos [or images] in folder

    Installation: Execute "DeShakInst.BAT" VirtualDub2 44282; AviSynth+ 3.7.5 updated to C:\DVD DESHAK.BAT updated to C:\UT and added to PATH Usage: DESHAK task[s] [parameters] Tasks: tp1: deshake pass1 LOG generation for 2nd pass tp2: deshake pass2 and compress video and audio to MP3 tcomp: compress (no deshake) twav: extract WAV and/or uses external WAV audio Parameters (more in help): vEXT: video extension (ie: vmov), default: vAVI qN: h264 quality 1-9 (9=lossless), def: q3 (crf23) aN: mp3 quality 1-5, def: a3 (192k) * generates: ZZoriginalname.AVI * inner settings on first lines ie: Audio synch delay (line 46) Min Requirements: XP; Win7x64 for aviSynth video NoiseReduction Klite Mega Codec Pack (with LAME encoder) Other Utilities: LOG2CHAPS.BAT generate _OGG.txt chapters @ scene change VID2AUD.BAT extract Audios VID2MKV.BAT multiplex vid+aud+chapters VIDJOIN.BAT merges videos to MKV
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    VALL-E

    VALL-E

    PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)

    We introduce a language modeling approach for text to speech synthesis (TTS). Specifically, we train a neural codec language model (called VALL-E) using discrete codes derived from an off-the-shelf neural audio codec model, and regard TTS as a conditional language modeling task rather than continuous signal regression as in previous work. During the pre-training stage, we scale up the TTS training data to 60K hours of English speech which is hundreds of times larger than existing systems. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    Lyra

    Lyra

    A Very Low-Bitrate Codec for Speech Compression

    lyra is a neural audio codec designed to deliver intelligible, natural-sounding speech at extremely low bitrates, making real-time communication viable on constrained networks. It replaces hand-engineered codecs with learned models that capture speech characteristics more efficiently and reconstruct waveforms with a neural vocoder. The system targets mobile-class hardware, balancing latency and quality so it can run in real-time on phones.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    EnCodec

    EnCodec

    State-of-the-art deep learning based audio codec

    Encodec is a neural audio codec developed by Meta for high-fidelity, low-bitrate audio compression using end-to-end deep learning. Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    AlphaPlayer

    AlphaPlayer

    AlphaPlayer is a video animation engine

    AlphaPlayer is positioned as a multimedia or media-player library or application under ByteDance, likely intended to provide video/audio playback functionality, streaming, or media rendering capabilities. It probably serves as a foundation for building media-heavy applications — offering features like playback control, streaming support, adaptive media handling, and possibly integration with custom codecs or streaming protocols. For developers building web, desktop, or mobile applications...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    FFME

    FFME

    FFME: The Advanced WPF MediaElement (based on FFmpeg)

    FFME: The Advanced WPF MediaElement (based on FFmpeg). FFME is an advanced and close drop-in replacement for Microsoft's WPF MediaElement Control. While the standard MediaElement uses DirectX (DirectShow) for media playback, FFME uses FFmpeg to read and decode audio and video. This means that for those of you who want to support stuff like HLS playback, or just don't want to go through the hassle of installing codecs on client machines, using FFME might just be the answer. Fast media seeking...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Macbuntu BIGSur 2021

    Macbuntu BIGSur 2021

    Macbuntu BIGSur 2021.6 x64 bit Kernel 5.11.18 on xubuntu lts 21.04

    New features: Mozilla Firefox, Anydesk remote control, Vlc player, Pinta, Gimp Photoshop, Windows font pack, Grubcostomizer, macOSX BIGSur Wallpaper icon,theme and cursor packs,ULauncher ctrl+space search, Guake terminal F12, Neofetch macOSX, Bleachbit system cleaner, Plankdock with whitedark-whitelight theme. About? WhiteSUR theme icon packs, macosx mouse cursor, original hd wallpapers added. The XFCE Desktop interface provides a comfortable and fast use for any computer. Login,...
    Downloads: 24 This Week
    Last Update:
    See Project
  • 19
    TNCLinux2021

    TNCLinux2021

    TNCLinux-2021.2-x64 3D CompizFusion Kernel 5.10.18

    TNCLinux 2021-2 x64 Base: Ubuntu 20.04.2 LTS Long term support Architecture: Amd64 / x64 Bit processor support Legacy-Uefi Language: Multilanguage / Turkish Kernel: 5.10.18 Xanmod Modify Kernel Popular applications, essential for first and end users, are integrated. Xanmod modified fast kernel is used. It constantly receives updates with the system. It has multimedia, office, system and customized content and applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    FFmpeg libav tutorial

    FFmpeg libav tutorial

    FFmpeg libav tutorial

    ...FFmpeg libav has lots of bindings for many languages like python, go and even if your language doesn't have it, you can still support it through the ffi (here's an example with Lua). We'll start with a quick lesson about what is video, audio, codec and container and then we'll go to a crash course on how to use FFmpeg command line and finally we'll write code, feel free to skip directly to the section Learn FFmpeg libav the Hard Way. Some people used to say that the Internet video streaming is the future of the traditional TV, in any case, the FFmpeg is something that is worth studying.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Freeware Advanced Audio Codec
    FAAC is an Advanced Audio Coder (MPEG2-AAC, MPEG4-AAC). The goal of FAAC is to explore the possibilities of AAC and exceed the quality of the currently best MP3 encoders. You can submit pull requests via GitHub: https://github.com/knik0/faac https://github.com/knik0/faad2
    Leader badge
    Downloads: 913 This Week
    Last Update:
    See Project
  • 22
    PlusOS (Plus Operating System)

    PlusOS (Plus Operating System)

    OS bootable from Linux,Windows and others partitions as auxiliary OS

    ...That's built to run as auxiliary OS with Major OS like Windows and Linux etc..bootable with grub2 and own boot loader.focused on eBook reading software(PDF,EPUB,MOBI,DJVU,CHM...).with programs like calculator, notepad,source code editor, hex viewer, text viewer,image viewer and many games.VESA 32bpp GUI from startup.read-only plus maybe writing support for standard file systems FAT/NTFS/EXT/CDFS.codec for archives(ZIP,RAR,7z,GZ ...).codec for image formats(JPG,BMP,GIF,PNG...).video player or at least thumbnails extract(AVI,MKV,MP4,3GP...). mountable as filesystem from ISO,ISZ file.generic drivers VBE,USB,HD Audio,Keyboard, mouse.bootable on qemu and of course on real computer systems.at least all drivers for one computer (my) this is primary goal. In summary goal is building a stable and usefull OS. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    WebRTC Android

    WebRTC Android

    webrtc VideoCall VideoConference

    WebRTC Android is a sample project that showcases how to implement real-time peer-to-peer communication (audio, video, and data) on Android using WebRTC. It demonstrates a complete pipeline—from signaling and connection setup to media capture and transmission—making it an excellent reference for developers looking to integrate WebRTC into their mobile apps. The project includes UI components and handles network state changes, codec configuration, and ICE negotiation to provide a robust base for real-world applications.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24

    FastoCloud PRO

    IPTV/NVR/CCTV/Video cloud https://fastocloud.com

    IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    pulseaudio-modules-bt

    pulseaudio-modules-bt

    PulseAudio bluetooth modules

    Adds Sony LDAC, aptX, aptX HD, AAC codecs (A2DP Audio) support to PulseAudio on Linux. This repo is a fork of PulseAudio Bluetooth modules. Added support for manual (expert) configuration for SBC codec parameters. You can use the parameters to override and fine-tune the default SBC codec config and manually setup configurations like SBC XQ, or Dual Channel HD mode. More info about SBC configuration options can be found at LineageOS documentation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next