Media Local Learning Framework

This directory contains code to support media‘s local learning experiment. It provides lightweight learning algorithms that can be trained on the user’s local device to tailor media performance / behavior to match the user's usage.

Terms

Feature vector - How we describe the “state of the world” to the learner. For example, we might describe a video using the features {width, height, format, frame rate}.

Target - The output value we'd like to predict, given some features. We might want to know a boolean representing “will playback be smooth?”.

Training example - A tuple {Feature vector, Target value} that demonstrates the desired target for some feature vector. The learning algorithm collects examples, and tries to generalize them to unseen features.

Classification - Class of problems for which the target value is nominal. For example, predicting the expected color from a set of five colors is a classification task. The key idea is that the target values are unordered.

Regression - Class of problems for which the target value is numeric. For example, predicting how tall a plant will grow is regression.

Model - A class of functions that relates features (inputs) to target values (outputs). For example, a linear model relates them as:

target = weight1 * feature1 + weight2 * feature2 + ...

Note that the weights aren‘t known in advance; we’ll choose them as part of the training process based on the training examples.

Model parameters - The missing values in our model that the learning learning algorithm tries to figure out based on the training data. In our linear model, we'd need to know weight1 and weight2.

Learning task - A problem we‘re trying to solve. For example, "Will this video element be played before it’s destroyed?"

Classes

There are several classes that we define here. While more detail can generally be found in the header for the class, an overview of the main ones is:

Learner - Base class for a thing that knows how to convert training data into a fully trained model (model + parameters). For example, we might have a Learner subclass that chooses the parameters for a Naive Bayes model. Similarly, we might have a Learner that trains a linear regression model.

LearningTask - Description of a task, and also, because it's convenient, a choice of model that will be used to learn it. It contains:

* name
* description of features (name, nominal vs numeric, etc.)
* description of the target value
* description / parameters of the learning model to be used

Instance - Set of feature values.

Value - Representation of a number or (hashed) string.

Models

All of our models are supervised.

Directory Structure

common/ - public interfaces
impl/ - learning algorithms, other implementation details
mojo/ - mojo service-side implementations
mojo/public/cpp - public headers / clients for mojo
mojo/public/mojom - public mojom interfaces