default search action

combined dblp search
author search
venue search
publication search

ask others

Cong Han 0001

> Home > Persons

Person information

affiliation: Columbia University, Department of Electrical Engineering, New York, NY, USA

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2026
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2601-17645
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2601-17645
Xilin Jiang, Qiaolin Wang, Junkai Wu, Xiaomin He, Zhongweiyang Xu, Yinghao Ma, Minshuo Piao, Kaiyi Yang, Xiuwen Zheng, Riki Shimizu, Yicong Chen, Arsalan Firoozi, Gavin Mischler, Sukru Samet Dindar, Richard J. Antonello, Linyang He, Tsun-An Hsieh, Xulin Fan, Yulun Wu, Yuesheng Ma, Chaitanya Amballa, Weixiong Chen, Jiarui Hai, Ruisi Li, Vishal Choudhari, Cong Han, Yinghao Aaron Li, Adeen Flinker, Mounya Elhilali, Emmanouil Benetos, Mark Hasegawa-Johnson, Romit Roy Choudhury, Nima Mesgarani:
AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking. CoRR abs/2601.17645 (2026)
2025
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/LiHM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/LiHM25
Yinghao Aaron Li, Cong Han, Nima Mesgarani:
StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech Synthesis. IEEE J. Sel. Top. Signal Process. 19(1): 283-296 (2025)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/JiangHLM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/JiangHLM25
Xilin Jiang, Cong Han, Yinghao Aaron Li, Nima Mesgarani:
Listen, Chat, and Remix: Text-Guided Soundscape Remixing for Enhanced Auditory Experience. IEEE J. Sel. Top. Signal Process. 19(4): 635-645 (2025)
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JiangHM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JiangHM25
Xilin Jiang, Cong Han, Nima Mesgarani:
Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation. ICASSP 2025: 1-5
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JiangLFHM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JiangLFHM25
Xilin Jiang, Yinghao Aaron Li, Adrian Nicolas Florea, Cong Han, Nima Mesgarani:
Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis. ICASSP 2025: 1-5
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/LiJHM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/LiJHM25
Yinghao Aaron Li, Xilin Jiang, Cong Han, Nima Mesgarani:
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion. NAACL (Long Papers) 2025: 4725-4744
2024
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HanWWH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HanWWH24
Cong Han, Kevin W. Wilson, Scott Wisdom, John R. Hershey:
Unsupervised Multi-Channel Separation And Adaptation. ICASSP 2024: 721-725
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JiangHLM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JiangHLM24
Xilin Jiang, Cong Han, Yinghao Aaron Li, Nima Mesgarani:
Exploring Self-supervised Contrastive Learning of Spatial Sound Event Representation. ICASSP 2024: 1281-1285
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-03710
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-03710
Xilin Jiang, Cong Han, Yinghao Aaron Li, Nima Mesgarani:
Listen, Chat, and Edit: Text-Guided Soundscape Modification for Enhanced Auditory Experience. CoRR abs/2402.03710 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-18257
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-18257
Xilin Jiang, Cong Han, Nima Mesgarani:
Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation. CoRR abs/2403.18257 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-09732
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-09732
Xilin Jiang, Yinghao Aaron Li, Adrian Nicolas Florea, Cong Han, Nima Mesgarani:
Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis. CoRR abs/2407.09732 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-10058
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-10058
Yinghao Aaron Li, Xilin Jiang, Cong Han, Nima Mesgarani:
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion. CoRR abs/2409.10058 (2024)
2023
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/embc/HanCLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/HanCLM23
Cong Han, Vishal Choudhari, Yinghao Aaron Li, Nima Mesgarani:
Improved Decoding of Attentional Selection in Multi-Talker Environments with Self-Supervised Learned Speech Representation. EMBC 2023: 1-5
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HanM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HanM23
Cong Han, Nima Mesgarani:
Online Binaural Speech Separation Of Moving Speakers With A Wavesplit Network. ICASSP 2023: 1-5
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiHJM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiHJM23
Yinghao Aaron Li, Cong Han, Xilin Jiang, Nima Mesgarani:
Phoneme-Level Bert for Enhanced Prosody of Text-To-Speech with Grapheme Predictions. ICASSP 2023: 1-5
[c16]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiHRMM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiHRMM23
Yinghao Aaron Li, Cong Han, Vinay S. Raghavan, Gavin Mischler, Nima Mesgarani:
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models. NeurIPS 2023
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/LiHM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/LiHM23
Yinghao Aaron Li, Cong Han, Nima Mesgarani:
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs. WASPAA 2023: 1-5
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-08810
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-08810
Yinghao Aaron Li, Cong Han, Xilin Jiang, Nima Mesgarani:
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions. CoRR abs/2301.08810 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-05756
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-05756
Cong Han, Vishal Choudhari, Yinghao Aaron Li, Nima Mesgarani:
Improved Decoding of Attentional Selection in Multi-Talker Environments with Self-Supervised Learned Speech Representation. CoRR abs/2302.05756 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-07458
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-07458
Cong Han, Nima Mesgarani:
Online Binaural Speech Separation of Moving Speakers With a Wavesplit Network. CoRR abs/2303.07458 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-11151
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-11151
Cong Han, Kevin W. Wilson, Scott Wisdom, John R. Hershey:
Unsupervised Multi-channel Separation and Adaptation. CoRR abs/2305.11151 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-07691
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-07691
Yinghao Aaron Li, Cong Han, Vinay S. Raghavan, Gavin Mischler, Nima Mesgarani:
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models. CoRR abs/2306.07691 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-09435
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-09435
Yinghao Aaron Li, Cong Han, Nima Mesgarani:
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs. CoRR abs/2307.09435 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09493
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09493
Yinghao Aaron Li, Cong Han, Xilin Jiang, Nima Mesgarani:
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform. CoRR abs/2309.09493 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-15938
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-15938
Xilin Jiang, Cong Han, Yinghao Aaron Li, Nima Mesgarani:
Exploring Self-Supervised Contrastive Learning of Spatial Sound Event Representation. CoRR abs/2309.15938 (2023)
2022
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HanKHSC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HanKHSC22
Cong Han, Emine Merve Kaya, Kyle Hoefer, Malcolm Slaney, Simon Carlile:
Multi-Channel Speech Denoising for Machine Ears. ICASSP 2022: 276-280
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/YangHLZY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/YangHLZY22
Bowen Yang, Cong Han, Yu Li, Lei Zuo, Zhou Yu:
Improving Conversational Recommendation Systems' Quality with Context-Aware Item Meta-Information. NAACL-HLT (Findings) 2022: 38-48
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiHM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiHM22
Yinghao Aaron Li, Cong Han, Nima Mesgarani:
Styletts-VC: One-Shot Voice Conversion by Knowledge Transfer From Style-Based TTS Models. SLT 2022: 920-927
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-08793
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-08793
Cong Han, Emine Merve Kaya, Kyle Hoefer, Malcolm Slaney, Simon Carlile:
Multi-Channel Speech Denoising for Machine Ears. CoRR abs/2202.08793 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-15439
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-15439
Yinghao Aaron Li, Cong Han, Nima Mesgarani:
StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech Synthesis. CoRR abs/2205.15439 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-14227
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-14227
Yinghao Aaron Li, Cong Han, Nima Mesgarani:
StyleTTS-VC: One-Shot Voice Conversion by Knowledge Transfer from Style-Based TTS Models. CoRR abs/2212.14227 (2022)
2021
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/LuoHM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LuoHM21
Yi Luo, Cong Han, Nima Mesgarani:
Group Communication With Context Codec for Lightweight Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1752-1761 (2021)
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuoCHLZM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuoCHLZM21
Yi Luo, Zhuo Chen, Cong Han, Chenda Li, Tianyan Zhou, Nima Mesgarani:
Rethinking The Separation Layers In Speech Separation Networks. ICASSP 2021: 1-5
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuoHM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuoHM21
Yi Luo, Cong Han, Nima Mesgarani:
Ultra-Lightweight Speech Separation Via Group Communication. ICASSP 2021: 16-20
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiCLHZKD0Q21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiCLHZKD0Q21
Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix, Shinji Watanabe, Yanmin Qian:
Dual-Path Modeling for Long Recording Speech Separation in Meetings. ICASSP 2021: 5739-5743
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanLLZK0DEHMC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanLLZK0DEHMC21
Cong Han, Yi Luo, Chenda Li, Tianyan Zhou, Keisuke Kinoshita, Shinji Watanabe, Marc Delcroix, Hakan Erdogan, John R. Hershey, Nima Mesgarani, Zhuo Chen:
Continuous Speech Separation Using Speaker Inventory for Long Recording. Interspeech 2021: 3036-3040
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuoHM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuoHM21
Yi Luo, Cong Han, Nima Mesgarani:
Empirical Analysis of Generalized Iterative Speech Separation Networks. Interspeech 2021: 3485-3489
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanLM21
Cong Han, Yi Luo, Nima Mesgarani:
Binaural Speech Separation of Moving Speakers With Preserved Spatial Cues. Interspeech 2021: 3505-3509
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/0004HM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/0004HM21
Yi Luo, Cong Han, Nima Mesgarani:
Distortion-Controlled Training for end-to-end Reverberant Speech Separation with Auxiliary Autoencoding Loss. SLT 2021: 825-832
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiLHLYZDKBQ0C21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiLHLYZDKBQ0C21
Chenda Li, Yi Luo, Cong Han, Jinyu Li, Takuya Yoshioka, Tianyan Zhou, Marc Delcroix, Keisuke Kinoshita, Christoph Böddeker, Yanmin Qian, Shinji Watanabe, Zhuo Chen:
Dual-Path RNN for Long Recording Speech Separation. SLT 2021: 865-872
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-11634
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-11634
Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix, Shinji Watanabe, Yanmin Qian:
Dual-Path Modeling for Long Recording Speech Separation in Meetings. CoRR abs/2102.11634 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-08140
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-08140
Bowen Yang, Cong Han, Yu Li, Lei Zuo, Zhou Yu:
Improving Conversational Recommendation Systems' Quality with Context-Aware Item Meta Information. CoRR abs/2112.08140 (2021)
2020
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HanLM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HanLM20
Cong Han, Yi Luo, Nima Mesgarani:
Real-Time Binaural Speech Separation with Preserved Spatial Cues. ICASSP 2020: 6404-6408
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-06637
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-06637
Cong Han, Yi Luo, Nima Mesgarani:
Real-time binaural speech separation with preserved spatial cues. CoRR abs/2002.06637 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-08397
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-08397
Yi Luo, Cong Han, Nima Mesgarani:
Ultra-Lightweight Speech Separation via Group Communication. CoRR abs/2011.08397 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-08400
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-08400
Yi Luo, Zhuo Chen, Cong Han, Chenda Li, Tianyan Zhou, Nima Mesgarani:
Rethinking the Separation Layers in Speech Separation Networks. CoRR abs/2011.08400 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-07291
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-07291
Yi Luo, Cong Han, Nima Mesgarani:
Group Communication with Context Codec for Ultra-Lightweight Source Separation. CoRR abs/2012.07291 (2020)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-09727
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-09727
Cong Han, Yi Luo, Chenda Li, Tianyan Zhou, Keisuke Kinoshita, Shinji Watanabe, Marc Delcroix, Hakan Erdogan, John R. Hershey, Nima Mesgarani, Zhuo Chen:
Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording. CoRR abs/2012.09727 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LuoHMCL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LuoHMCL19
Yi Luo, Cong Han, Nima Mesgarani, Enea Ceolini, Shih-Chii Liu:
FaSNet: Low-Latency Adaptive Beamforming for Multi-Microphone Audio Processing. ASRU 2019: 260-267
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Han0M19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Han0M19
Cong Han, Yi Luo, Nima Mesgarani:
Online Deep Attractor Network for Real-time Single-channel Speech Separation. ICASSP 2019: 361-365
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-13387
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-13387
Yi Luo, Enea Ceolini, Cong Han, Shih-Chii Liu, Nima Mesgarani:
FaSNet: Low-latency Adaptive Beamforming for Multi-microphone Audio Processing. CoRR abs/1909.13387 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.