ISIPLab

This is Intelligent Signal and Image Processing Lab at the Computer Science and Technogy Institute of Anhui University. Our research focuses on providing the state of the art technologies and exploring theories for modern signal and image processing. Specifically, we have proposed a series of fast real-value discrete Gabor transform theories and/or algorithms to efficiently perform signal transformation, the aim of which is to obtain a high resolution spectrum simultaneously in time and frequency domain. These technologies have been widely used in speech processing, time-frequency signal analysis, and image compression, et al.

In addition, we are interested in speech and image processing for the next generation of human-computer interface as well as for improving human health via intelligent signal processing methodology. For example, we have proposed efficient technologies for improving the communication quality of patients of laryngocarcinoma via transforming the whisper-like speech to the normal one. For improving the communication in very noisy environment, we provide the receiver a normal voiced speech which is obtained from the bone-conducted microphone, which can be widely used in noisy factory and military scenes. We also pay much attention to vein and palmprint recognition technology which has been successfully used in industry areas.

Currently, we also deliver part of our attention to physiological signal processing such as EEG for emotional conversion or recognition and brain PET signal for Alzheimer’s disease prediction.

news

Jan 21, 2023	Our Paper “A Novel Attention-Guided Generative Adversarial Network for Whisper-to-Normal Speech Conversion” was accepted by Cognitive Computation.

latest posts

May 20, 2025	Demo page for "CETTS"
May 8, 2025	Demo page for "MEDAF"

selected publications

Multi-scale 3D-CRU for EEG emotion recognition

Hao Dong, Jian Zhou, Cunhang Fan, and 3 more authors

Biomedical Physics & Engineering Express, 2024

Bib HTML

@article{dh2024,
  author = {Dong, Hao and Zhou, Jian and Fan, Cunhang and Zheng, Wenming and Tao, Liang and Kwan, Hon Keung},
  year = {2024},
  title = {Multi-scale 3D-CRU for EEG emotion recognition},
  volume = {10},
  journal = {Biomedical Physics & Engineering Express},
  doi = {10.1088/2057-1976/ad43f1},
}

Dynamic Ensemble Teacher-Student Distillation Framework for Light-weight Fake Audio Detection

Jun Xue, Cunhang Fan, Jiangyan Yi, and 2 more authors

IEEE Signal Processing Letters, 2024

Bib HTML

@article{20240805,
  author = {Xue, Jun and Fan, Cunhang and Yi, Jiangyan and Zhou, Jian and Lv, Zhao},
  year = {2024},
  title = {Dynamic Ensemble Teacher-Student Distillation Framework for Light-weight Fake Audio Detection},
  volume = {},
  journal = {IEEE Signal Processing Letters},
  doi = {10.1109/LSP.2024.3431936},
}

Multi-Level Information Aggregation Based Graph Attention Networks Towards Fake Speech Detection

Jian Zhou, Yong Li, Cunhang Fan, and 2 more authors

IEEE Signal Processing Letters, 2024

Bib HTML

@article{ly2024,
  author = {Zhou, Jian and Li, Yong and Fan, Cunhang and Tao, Liang and Kwan, Hon Keung},
  year = {2024},
  title = {Multi-Level Information Aggregation Based Graph Attention Networks Towards Fake Speech Detection},
  volume = {31},
  journal = {IEEE Signal Processing Letters},
  doi = {10.1109/LSP.2024.3408676},
}

A Novel Attention-Guided Generative Adversarial Network for Whisper-to-Normal Speech Conversion

Teng Gao, Qing Pan, Jian Zhou, and 3 more authors

Cognitive Computation, Jan 2023

Bib HTML

@article{20230115,
  author = {Gao, Teng and Pan, Qing and Zhou, Jian and Wang, Huabin and Tao, Liang and Kwan, Hon},
  year = {2023},
  month = jan,
  title = {A Novel Attention-Guided Generative Adversarial Network for Whisper-to-Normal Speech Conversion},
  volume = {15},
  journal = {Cognitive Computation},
  doi = {10.1007/s12559-023-10108-9},
}

SETransformer: Speech Enhancement Transformer

Weiwei Yu, Jian Zhou, HuaBin Wang, and 1 more author

Cognitive Computation, May 2022

Bib HTML

@article{20220514001,
  author = {Yu, Weiwei and Zhou, Jian and Wang, HuaBin and Tao, Liang},
  year = {2022},
  month = may,
  title = {SETransformer: Speech Enhancement Transformer},
  volume = {14},
  journal = {Cognitive Computation},
  doi = {10.1007/s12559-020-09817-2},
}

Multistage Model for Robust Face Alignment Using Deep Neural Networks

Huabin Wang, Rui Cheng, Jian Zhou, and 2 more authors

Cognitive Computation, May 2022

Bib HTML

@article{20220514,
  author = {Wang, Huabin and Cheng, Rui and Zhou, Jian and Tao, Liang and Kwan, Hon},
  year = {2022},
  month = may,
  title = {Multistage Model for Robust Face Alignment Using Deep Neural Networks},
  volume = {14},
  journal = {Cognitive Computation},
  doi = {10.1007/s12559-021-09846-5},
}