HTK
 [ Home | Register ]
Home

Getting HTK
    Register at CUED

Documentation            
    FAQ
    History of HTK
    CUED LVR Systems
    License

Development
    Get involved
    Future Plans


HTK3-website

HTK FAQ

License Issues

  • What is the role of Microsoft in all this?

    See the 'History of HTK' page on the website for a full explanation. The short version is:

    HTK was originally developed at the Cambridge University Engineering Department (CUED). In 1993 Entropic Research Laboratory Inc. acquired the rights to sell HTK and the development of HTK was fully transferred to Entropic in 1995 when the Entropic Cambridge Research Laboratory Ltd was established. HTK was sold by Entropic until 1999 when Microsoft bought Entropic. Microsoft has now licensed HTK back to CUED and is providing support so that CUED can redistribute HTK and provide development support via the HTK3 web site. While Microsoft retains the copyright to the existing HTK code, everybody is encouraged to make changes to the source code and contribute them for inclusion in HTK3.

  • Can I build & sell products based on HTK3?

    You may build a product but you are not allowed to redistribute (parts of) HTK3, i.e. you can't ship shrink-wrap boxes with products that contain HTK3 code.

  • Can I use this version for teaching & academic research?

    Yes, absolutely.

  • Can I use this version in commercial product Research & Development?

    Yes. You can for example use HTK3 to train models that are then used in your products.

General

  • What is HTK and what is it not?

    HTK is a toolkit for research in automatic speech recognition and has been used in many commercial and academic research groups for many years.

    It is not a desktop dictation system or an application that you just install on your PC to get a speech interface to your computer.

  • Are there any teaching materials available that use HTK3?

    Not yet, but we always welcome contributions.

  • I have a problem using HTK3 -- where can I get support?

    A set of mailing lists have been created to help solve HTK related problems (see Mailing List page). Queries about the use of HTK3 should be addressed to htk-users@eng.cam.ac.uk.

Source Code

  • What is the difference between this version and the Entropic one?

    The HTK3 code is based on the final release from Entropic (V2.2_ref). The source code now contains Microsoft Copyright headers; there is revised and consistent indentation and some minor changes to remove some compiler warnings have been made. Functionally the code in HTK 3.0.0 should be identical to HTK V2.2_ref.

  • Can I build the CU-HTK Hub4/5 evaluation system with HTK3?

    No. The HTK version used internally at the Cambridge University Engineering Department has some extra functionality not available in HTK3; such as PLP coefficients; vocal tract length normalisation; extended types of adaptation; lattice operations; support for context dependency beyond triphones; long-span language model support etc. Some of the most generally useful of these features will be made available in HTK3 over the next few months. In addition to this HTK3 does not include a large vocabulary decoder that is usable with cross-word triphones and long-span language models at the moment (HVite can rescore but not generate lattices for such configurations). We hope to be able to provide a capable large vocabulary decoder in the medium term.

  • What kind of computer/OS do I need to run HTK3?

    The distributed version of HTK3 should build on Solaris, IRIX, HPUX and Linux. If it doesn't, please file a bug report. HTK has also been successfully built on Windows NT.

    HTK has always endeavoured to be as machine independent as possible and it should be possible to build all of the main tools on any machine supporting ANSI C and either X-Windows or MS-Windows. The only exception to this concerns direct audio recording/replay. If this is needed, an appropriate driver may need to be written.

  • How can I use HTK with my own data in text/matlab/binary format?

    You need to convert your data to the HTK parameter format (you can eiter use the USER parameter kind or pretend they are MFCC or similar). Typically this involves writing a small programm that does the conversion. For an example see HTKWrite.c

Commonly used abbreviations:

CUED - Cambridge University Engineering Department
HTK - Hidden Markov Model Toolkit
PLP - Perceptual Linear Prediction

Comments and suggestions to htk-mgr@eng.cam.ac.uk