27th International Conference on Machine Learning (ICML), Haifa 2010

OpenKernel

author: Cyril Allauzen, Research at Google, Google, Inc.
published: July 20, 2010, recorded: June 2010, views: 4729

Slides

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Lecture popularity: You need to login to cast your vote.

Description

The OpenKernel library is an open-source software library for designing, combining, learning and using kernels for machine learning applications

The library supports the design and use of kernels defined over dense and sparse real vectors, as well as over sequences or distributions of sequences.

For dense and sparse features, the library provides implementation of the classical kernels: linear, polynomial, Gaussian and sigmoid.

For sequences and distributions of sequences, the library implements the rational kernel framework of Cortes et al. (JMLR, 2004). The library supplies the following sequence kernels:

n -gram kernels, gappy n-gram kernels, mismatch kernels (Leslie et al., 2004), and gives the utilities for creating arbitrary rational kernels simply by providing the weighted finite-state transducers they are based on.

Kernels can be combined by taking their sum or their product, and can be composed with a polynomial, a Gaussian or a sigmoid. They support on-demand evaluation and caching. In addition to its own binary format, the library uses the ASCII format of LIBSVM/LIBLINEAR/SVMlight for representing features (and precomputed kernels for LIBSVM).

Finally, the OpenKernel library also includes several options for using training data to automatically combine multiple kernels. This is particularly useful when the single best kernel for the task is not known. The algorithms implemented include

L1-regularized linear combinations (Lanckriet et al. JMLR 2004); L2-regularized linear combinations (Cortes et al. UAI 2009); L2-regularized quadratic combinations (Cortes et al. NIPS 2009), as well as kernel correlation, or alignment (Cortes et al. ICML 2010), based combinations. Specialized efficient versions of these algorithms are also made available for weighting features and sparseness and can be used to further improve efficiency. The output kernels can be easily used in conjunction with LIBSVM, SVMlight and included kernel ridge regression implementations. Full reference documentation, tutorials and examples (with formatted datasets) are included.

The library is an open-source project distributed under the Apache license (2.0). This work has been partially supported by Google Inc. The library uses the OpenFst library for representing and manipulating weighted finite-state transducers.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

Comment:
Name:
Email address:
URL:

make sure you have javascript enabled or clear this field: