Utilize este identificador para referenciar este registo:
https://hdl.handle.net/1822/2054
Registo completo
Campo DC | Valor | Idioma |
---|---|---|
dc.contributor.author | Lima, C. S. | - |
dc.contributor.author | Oliveira, Jorge F. | - |
dc.date.accessioned | 2005-06-08T21:42:48Z | - |
dc.date.available | 2005-06-08T21:42:48Z | - |
dc.date.issued | 2003-12 | - |
dc.identifier.citation | INTERNATIONAL WORKSHOP ON MODELS AND ANALYSIS OF VOCAL EMISSIONS FOR BIOMEDICAL APPLICATIONS (MAVEBA), 3, Firenze, 2003. | eng |
dc.identifier.uri | https://hdl.handle.net/1822/2054 | - |
dc.description.abstract | The changing on peaks structure of the speech spectrum is perhaps the most important cause of degradation of speech recognition systems under adverse conditions. Another drawback concerned to the additive noise effect occurs on the flat spectral zones which are usually raised proportionally to the noise level. These combined effects on both the peaked and the flat spectral zones can be alleviated by trying to restore its original structure, which assumes noise knowledge. However, the random nature and the variability of the noise, the difficulty in discriminating speech pauses, among others, discourage the use of noise estimates as the basis of robust speech recognition algorithms. Alternative approaches based on normalisation procedures become very promising since the noise effect can be alleviated without any knowledge regarding to its existence. This paper suggests a spectral normalisation that though being different can be viewed as a noise estimation procedure in a frame by frame basis, so assuming the clean database as lightly corrupted. This speech normalisation is used to restore the normalised speech spectrum. This normalised spectrum is then re-normalised by a baseline spectrum normalisation method, which concentrates essentially in the speech regions of small energy, since in these regions the noise is more dominant, so they require a better degree of robustness. | eng |
dc.language.iso | eng | eng |
dc.rights | openAccess | eng |
dc.subject | Features robustness | eng |
dc.subject | Feature adaptation | eng |
dc.subject | Robust speech recognition | eng |
dc.title | Spectral bi-normalisation for speech recognition in additive noise | eng |
dc.type | conferencePaper | eng |
dc.peerreviewed | yes | eng |
Aparece nas coleções: | DEI - Artigos em atas de congressos internacionais |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
maveba3.pdf | 174,33 kB | Adobe PDF | Ver/Abrir |