Spectral bi-normalisation for speech recognition in additive noise

Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/2054

Registo completo

Campo DC	Valor	Idioma
dc.contributor.author	Lima, C. S.	-
dc.contributor.author	Oliveira, Jorge F.	-
dc.date.accessioned	2005-06-08T21:42:48Z	-
dc.date.available	2005-06-08T21:42:48Z	-
dc.date.issued	2003-12	-
dc.identifier.citation	INTERNATIONAL WORKSHOP ON MODELS AND ANALYSIS OF VOCAL EMISSIONS FOR BIOMEDICAL APPLICATIONS (MAVEBA), 3, Firenze, 2003.	eng
dc.identifier.uri	https://hdl.handle.net/1822/2054	-
dc.description.abstract	The changing on peaks structure of the speech spectrum is perhaps the most important cause of degradation of speech recognition systems under adverse conditions. Another drawback concerned to the additive noise effect occurs on the flat spectral zones which are usually raised proportionally to the noise level. These combined effects on both the peaked and the flat spectral zones can be alleviated by trying to restore its original structure, which assumes noise knowledge. However, the random nature and the variability of the noise, the difficulty in discriminating speech pauses, among others, discourage the use of noise estimates as the basis of robust speech recognition algorithms. Alternative approaches based on normalisation procedures become very promising since the noise effect can be alleviated without any knowledge regarding to its existence. This paper suggests a spectral normalisation that though being different can be viewed as a noise estimation procedure in a frame by frame basis, so assuming the clean database as lightly corrupted. This speech normalisation is used to restore the normalised speech spectrum. This normalised spectrum is then re-normalised by a baseline spectrum normalisation method, which concentrates essentially in the speech regions of small energy, since in these regions the noise is more dominant, so they require a better degree of robustness.	eng
dc.language.iso	eng	eng
dc.rights	openAccess	eng
dc.subject	Features robustness	eng
dc.subject	Feature adaptation	eng
dc.subject	Robust speech recognition	eng
dc.title	Spectral bi-normalisation for speech recognition in additive noise	eng
dc.type	conferencePaper	eng
dc.peerreviewed	yes	eng
Aparece nas coleções:	DEI - Artigos em atas de congressos internacionais

Ficheiros deste registo:

Ficheiro	Descrição	Tamanho	Formato
maveba3.pdf		174,33 kB	Adobe PDF	Ver/Abrir

Ver registo simples Sugerir correção Estatísticas