Ph.D. Candidate in Computer Science and Cognitive Science at Indiana University.
Research Group: Signals & Artificial Intelligence Group in Engineering (SAIGE)
Email: ZHenK At IU doT EdU
LinkedIn profile
I conduct research on audio and acoustic signal processing in the current deep/machine learning paradigm, with the focus on both model capacity and efficiency. Concretely, I've been working on cross-module residual learning that is compatible with both advanced, fast changing data-driven modules and conventional methodologies in audiology for lightweight speech coding. In terms of monaural speech enhancement, we proposed a hybrid architecture incorporating both CNN and RNN in a densely connected manner to enable dual-level context aggregation, efficiently. Besides, I worked a psychoacoustically weighting scheme to prioritize the model training towards an energy efficient speech denoising autoencoder. My supervisor is Prof. Minje Kim.
Kai Zhen, Jongmo Sung, Mi Suk Lee, Seungkwon Beack, and Minje Kim, "EFFICIENT AND SCALABLE NEURAL RESIDUAL WAVEFORM CODING WITH COLLABORATIVE QUANTIZATION" [Demo]
Kai Zhen, Mi Suk Lee, Minje Kim, "A DUAL-STAGED CONTEXT AGGREGATION METHOD TOWARDS EFFICIENT END-TO-END SPEECH ENHANCEMENT" [Demo]
Kai Zhen, Jongmo Sung, Mi Suk Lee, Seungkwon Beack, and Minje Kim, "Cascaded Cross-Module Residual Learning towards Lightweight End-to-End Speech Coding," In Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH'19), Graz, Austria, September 15-19, 2019.
[PDF] [BibTex] [Demo]
Kai Zhen, Aswin Sivaraman, Jongmo Sung, Minje Kim. On Psychoacoustically Weighted Cost
Functions Towards Resource-efficient Deep Neural Networks for Speech Denoising.
[PDF] [BibTex] [US Patent App. 16/122,708]
Learning graphical model (probabilistic inference in Bayesian network) through Latent Dirichlet Allocation, 2016 Spring. A Java version of naive LDA is also provided:)
PageRank algorithm in parallel via MPJ-Express open library, 2016 Fall.
I served as a reviewer of ICASSP 2019.
I served as a reviewer of EURASIP Journal on Advances in Signal Processing.
I served as a sub-reviewer of AAAI-2017, 2018.