TarsosDSP：Java实时音频处理框架

PDF文件

下载需积分: 10 | 217KB | 更新于2024-08-30 | 72 浏览量 | 举报收藏

立即下载

"TarsosDSP是一个用Java编写的实时音频处理框架，适合进行音频分析、处理和特征提取。它提供了一种独特的功能，在Java环境中同时支持实时分析、处理和特征提取。该框架包含实用的音频处理算法，可以方便地扩展，并且没有外部依赖。其主要特性包括重采样算法、起始检测器、多种音高估计算法、时间拉伸算法、音高转换算法以及计算Constant-Q的算法等。" TarsosDSP是针对音频处理领域的一个重要工具，由Joren Six、Olmo Cornelis和Marc Leman等人开发。这个框架特别适用于初学者，因为它不仅提供了丰富的音频处理功能，还有一份英文论文可供深入学习。尽管是用Java语言实现的，但TarsosDSP的强大之处在于它的实时性能，能够应对各种实时音频分析和处理任务。在音频分析方面，TarsosDSP包含了起始检测器，这是一个关键组件，用于识别音频中的特定事件或节拍。此外，框架还提供了多种音高估计算法，这在音乐分析和语音处理中至关重要，如自动音乐标记和语音识别。音高估计算法的多样性使得用户可以根据具体应用选择最适合的方法。时间拉伸和音高转换算法则是音频编辑和变换的核心部分。时间拉伸允许在不改变音高的情况下调整音频的速度，而音高转换则可以在保持原始播放速度的同时改变音调。这两种功能在音乐制作、电影配乐以及语音合成等领域都有广泛应用。 TarsosDSP还包括一个Constant-Q算法，这是一种在音乐理论和音频分析中常用的频谱表示方法，能够提供更均匀的频率分辨率，对于理解和操作音乐结构特别有用。框架的设计考虑了易用性和可扩展性。每个算法都被设计成简单的处理管道，使得开发者可以轻松地添加自定义算法或集成到现有的项目中。没有外部依赖使得TarsosDSP成为一个轻量级的解决方案，无论是在桌面环境还是嵌入式系统中，都可以快速部署和运行。总结来说，TarsosDSP是音频处理领域的一个强大工具，它集成了多种实时音频处理算法，涵盖了从基本的信号处理到高级的音乐分析功能。对于学术研究、软件开发或者音频工程专业人士来说，TarsosDSP都是一个不可多得的资源。

TarsosDSP, a Real-Time Audio Processing

Framework in Java

Joren Six

1,2

, Olmo Cornelis

, Marc Leman

University Ghent, IPEM, Sint-Pietersnieuwstraat 41, 9000, Gent, Belgium

University College Ghent, School of Arts, Jozef Kluyskensstraat 2, 9000 Gent, Belgium

Correspondence should be addressed to Joren Six ([email protected])

ABSTRACT

This paper presents TarsosDSP, a framework for real-time audio analysis and processing. Most libraries and

frameworks oﬀer either audio analysis and feature extraction or audio synthesis and processing. TarsosDSP

is one of a only a few frameworks that oﬀers both analysis, processing and feature extraction in real-time, a

unique feature in the Java ecosystem. The framework contains practical audio processing algorithms, it can

be extended easily, and has no external dependencies. Each algorithm is implemented as simple as possible

thanks to a straightforward processing pipeline. TarsosDSP’s features include a resampling algorithm, onset

detectors, a number of pitch estimation algorithms, a time stretch algorithm, a pitch shifting algorithm, and

an algorithm to calculate the Constant-Q. The framework also allows simple audio synthesis, some audio

eﬀects, and several ﬁlters. The Open Source framework is a valuable contribution to the MIR-Community

and ideal ﬁt for interactive MIR-applications on Android.

1. INTRODUCTION

Frameworks or libraries

for audio processing can be di-

vided into two categories.The ﬁrst category offers audio

analysis and feature extraction. The second category of-

fers audio synthesis capabilities. Both types may or may

not operate in real-time. Table 1 shows a partial overview

of notable audio frameworks. It shows that only a few

frameworks offer real-time feature extraction combined

with synthesis capabilities. To the best of the authors’

knowledge, TarsosDSP is unique in that regard within

the Java ecosystem. The combination of real-time fea-

ture extraction and synthesis can be of use for music ed-

ucation tools or music video games. Especially for de-

velopment on the Android platform there is a need for

such functionality.

TarsosDSP also ﬁlls a need for educational tools for

Music Information Retrieval. As identiﬁed by Gomez

in [14], there is a need for comprehensible, well-

documented MIR-frameworks which perform useful

tasks on every platforms, without the requirement of a

costly software package like Matlab. TarsosDSP serves

The distinction between library and framework is explained in [2].

In short, a framework is an abstract speciﬁcation of an application

whereby analysis and design is reused, conversely when using a (class)

library code is reused but a library does not enforce a design.

this educational goal, it has already been used by several

master students as a starting point into music information

retrieval[5, 32, 28].

The framework tries to hit the sweet spot between be-

ing capable enough to get real tasks done, and compact

enough to serve as a demonstration for beginning MIR-

researchers on how audio processing works in practice.

TarsosDSP therefore targets both students and more ex-

perienced researchers who want to make use of the im-

plemented features.

After this introduction a section about the design deci-

sions made follows, then the main features of TarsosDSP

are highlighted. Chapter four is about the availability of

the framework. The paper ends with a conclusion and

future work.

2. DESIGN DECISIONS

To meet the goals stated in the introduction a couple of

design decisions were made.

2.1. Java based

TarsosDSP was written in Java to allow portability from

one platform to another. The automatic memory manage-

ment facilities are a great boon for a system implemented

in Java. These features allow a clean implementation

AES 53

INTERNATIONAL CONFERENCE, London, UK, 2014 January 27–29

下载后可阅读完整内容，剩余6页未读，立即下载

布川酷

粉丝: 0

TarsosDSP：Java实时音频处理框架

TarsosDSP-2.4.jar

TarsosDSP资料

SayWhat:言语治疗游戏（Tarsos的扩展）

SayWhat游戏：Tarsos扩展的言语治疗应用

ASR生产工具_MiFi Tool & Drivers.7z

基于C#和C++实现的幼儿园信息管理系统+源码+项目文档（毕业设计&课程设计&项目开发）

SP970 V13 新原版BOOT.zip

Python测试题.docx

082-java精品项目-基于ssm的宠物医院商城系统.zip

作品登记表(教师网络空间应用案例、创新教育教学案例).pdf

最新资源