Paper
6 April 2023 Research and application of end-to-end pig speech recognition model
Chenglong Du, Mingmin Gong, Wei Du, Jie Xie, ZhiXiang Gao
Author Affiliations +
Proceedings Volume 12615, International Conference on Signal Processing and Communication Technology (SPCT 2022); 126151N (2023) https://doi.org/10.1117/12.2674249
Event: International Conference on Signal Processing and Communication Technology (SPCT 2022), 2022, Harbin, China
Abstract
In this project, people can determine the location of the pig and lock the pig through the sound signal of the pig. Experienced keepers can also judge the cause of the sound according to the sound of the locked pig and determine the health status of the pig. This paper mainly studies how to enrich the audio spectrum features extracted from the original voice file and minimize the loss of signals in the process of generating spectrum features, especially the spectrum features that play a key role in downstream tasks. This paper proposes the fusion of audio spectrum features based on CNN model and MFCC audio spectrum features with different frame shifts and frame lengths, and serves as the input layer of feature representation and backbone model.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Chenglong Du, Mingmin Gong, Wei Du, Jie Xie, and ZhiXiang Gao "Research and application of end-to-end pig speech recognition model", Proc. SPIE 12615, International Conference on Signal Processing and Communication Technology (SPCT 2022), 126151N (6 April 2023); https://doi.org/10.1117/12.2674249
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data modeling

Feature extraction

Fourier transforms

Animals

Windows

Animal model studies

Speech recognition

Back to Top