Cluster Telemetry

Contents

Overview

Cluster Telemetry allows you to run telemetry‘s benchmarks, lua scripts and other tasks using multiple repository patches through Alexa’s top 1 million web pages. Developers can use the framework to measure the performance of their patch against the top subset of the internet on both Desktop and Android. tl;dr documentation is here.

SKP files are a binary format for the draw commands Chromium sends to Skia for rasterization. The goal of the project started off with wanting to collect a large repository of 10k SKP files. This repository, after incremental changes in approaches, has since grown to ~910k and now supports running all telemetry benchmarks. The top level feature request of this project was skia:1268.

A web application has been created on Google Compute Engine that automates the process of capturing new archives and running telemetry benchmarks at a click of a button; results are emailed to the requester and the web application contains complete history of runs with links to results. You can run telemetry benchmarks at http://ct.skia.org.

The framework also contains the ability to run lua scripts on the SKP repository to scrape web pages. It only takes a few minutes to run a lua scraping script on ~910k SKP files.

Most users will use these three features:

  • Chromium Perf. Documentation here. Webpage here.
  • Chromium Analysis. Documentation here. Webpage here.
  • Run Lua Scripts on SKP repositories. Documentation about lua bindings is here. Webpage here.

Note: The top 1M web pages includes potentially offensive content. Please use caution when visiting page links from the framework.

Framework Usage

The Chromium Perf page in CT has been used to gather perf data over the top 10k web pages for the following Chromium projects:

  • Slimming paint
  • Performance data for layer squashing and compositing overlap map
  • SkPaint in Graphics Context
  • Culling
  • New paint dictionary

blink-dev threads discussing how to make Chrome faster using the results gathered from CT:

Documents detailing data generated by the framework:

The framework has also been used to run multiple lua scripts to scrape the SKP repositories for the the following: chars-vs-glyphs, bitmap transform types, gradient color counter, 3 color gradient checks, etc. This has been very useful for the Skia team to help determine which parts of the library to optimize and focus on.

All runs are recorded here.

System Architecture

System Diagram

CT System Diagram

Detailed explanation of steps

  1. User submits a Lua script task, a Performance task, an Analysis task, or an Admin task (build chrome, recreate pagesets, recreate webpage archives, capture SKPs) using the GCE web application here.

  2. Each task is exposed by the web application in JSON. The CT master polls the web application and picks up new tasks. It has the ability to run tasks in parallel.

  3. The master triggers swarming tasks using the master scripts here. The master scripts then check to see when the tasks are done.

  4. Swarming bots in the CT pool execute the task using the worker scripts here. All generated artifacts (CSV files, logs, SKP files, archives, etc) are then copied to Google Storage.

  5. Once swarming tasks complete, master scripts read the generated artifacts from Google Storage and consolidate them (if required).

  6. The master scripts then email results of the task to the user who requested it. The scripts also update the status of the task to completed on the web application.

Code

Cluster Telemetry is primarily written in Go with a few python scripts. The framework lives in master/ct.

Contact Us

If you have questions, please email cluster-telemetry@chromium.org or contact rmistry@ directly.