Kaldi (software)

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
Kaldi
DevelopersDaniel Povey and others
Stable release
5.5.636 / February 2020; 6 years ago (2020-02)
Repositoryhttps://github.com/kaldi-asr/kaldi
Written inC++
Engine
    Lua error in Module:EditAtWikidata at line 29: attempt to index field 'wikibase' (a nil value).
    Operating systemUnix systems (Linux, BSD, OSX 10.{8,9} etc.), Windows (via Cygwin)
    TypeSpeech recognition
    LicenseApache License v.2.0[1]
    Websitekaldi-asr.org

    Kaldi is an open-source speech recognition toolkit written in C++ for speech recognition and signal processing, freely available under the Apache License v2.0.

    Kaldi aims to provide software that is flexible and extensible,[2] and is intended for use by automatic speech recognition (ASR) researchers for building a recognition system.

    It supports linear transforms, MMI, boosted MMI and MCE discriminative training, feature-space discriminative training, and deep neural networks.[3]

    Kaldi is capable of generating features like mfcc, fbank, fMLLR, etc. Hence in recent deep neural network research, a popular usage of Kaldi is to pre-process raw waveform into acoustic feature for end-to-end neural models.

    Kaldi has been incorporated as part of the CHiME Speech Separation and Recognition Challenge over several successive events.[4][5][6] The software was initially developed as part of a 2009 workshop at Johns Hopkins University.[7]

    Kaldi is named after the legendary Ethiopian goat herder Kaldi who was said to have discovered the coffee plant.[8]

    See also

    [edit | edit source]

    Lua error in mw.title.lua at line 392: bad argument #2 to 'title.new' (unrecognized namespace name 'Portal').

    References

    [edit | edit source]
    1. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    2. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    3. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    4. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    5. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    6. ^ Emmanuel Vincent, Jon Barker, Shinji Watanabe, Jonathan Le Roux, Francesco Nesta, et al.. The second 'CHiME' Speech Separation and Recognition Challenge: Datasets, tasks and baselines. ICASSP - 38th International Conference on Acoustics, Speech, and Signal Processing - 2013, May 2013, Vancouver, Canada. pp.126-130, 2013.
    7. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    8. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    [edit | edit source]
    • Lua error in Module:Official_website at line 94: attempt to index field 'wikibase' (a nil value).
    • Kaldi – The official GitHub project
    • Kaldi paper - The Kaldi Speech Recognition Toolkit
    • VOSK – open source and commercial models from Alpha Cephei on Kaldi foundations