Kernel eigenvoice

Speaker adaptation is an important technology to fine-tune either features or speech models for mis-match due to inter-speaker variation. In the last decade, eigenvoice (EV) speaker adaptation has been developed. It makes use of the prior knowledge of training speakers to provide a fast adaptation algorithm (in other words, only a small amount of adaptation data is needed). Inspired by the kernel eigenface idea in face recognition, kernel eigenvoice (KEV) is proposed.^[1] KEV is a non-linear generalization to EV. This incorporates Kernel principal component analysis, a non-linear version of Principal Component Analysis, to capture higher order correlations in order to further explore the speaker space and enhance recognition performance.

References

^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

External links

Kernel Eigenvoice Speaker Adaptation Archived 2012-03-12 at the Wayback Machine, ScientificCommons
Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
Speedup of Kernel Eigenvoice Speaker Adaptation by Embedded Kernel PCA, ICSLP 2004.
Speaker Adaptation via Composite Kernel PCA, NIPS 2003.
Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

[1] Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

[1]

Kernel eigenvoice

See also

References

External links

Navigation menu

Kernel eigenvoice

See also

References

External links

Navigation menu

Search