Conference Proceedings
- Home
- A/B user testing when your CDN cache messes with your results
A/B user testing when your CDN cache messes with your results
Description
Typical A/B user testing changes client-side behavior between users and compares metrics to evaluate the impact of the change. When applying this principle to video (pipelines/containers/codecs/…) changes, things become more complicated. As the the user selects a version of an asset based on the bucket it is part of, the asset needs to be processed for each bucket and will propagate independently through your CDN. Transient CDN effects, like caching, make that the results in this scenario not reflect the real-life behavior anymore. Suddenly, for a single ingest, the number of cache misses doubles, which negatively impacts the start up time, watch times, favoriting, likes etc. This effect is less outspoken if your CDN cache is not impacting your performance (e.g. you’re Netflix and have high intent and large video buffers). But when you rely on loading video quickly, client-side A/B testing messes with your results.
In our Media Centric Experiment approach, assets are bucketed at ingest. With a large number of videos, differences in performance can be attributed to the treatment of both groups. Unfortunately, you can’t just take the ‘average’ of a metric and assume things improve. We’ll discuss the statistical processing (bootstrapping and a Z-test) we apply to confidently make conclusions on the performance of an experiment and make a decision to ship.
In this talk, we’ll dive into the issue for A/B testing with video, how we set up our infrastructure to run Media Centric Experiment and explain (in laymen terms) the statistical procedure we follow.
Presented at Demuxed 2019 in San Francisco.
Conference
Speakers
Other Proceedings
Here are some other proceedings that you might find interesting.
What Codec Should I Use?
Alan Resnick
Doing Server-Side Ad Insertion on Live Sports for 25.3M Concurrent Users
Ashutosh Agrawal
Is now the time to solve the deepfake threat?
Roderick Hodgson
Super Resolution: The scaler of tomorrow, here today!
Nick Chadwick
The do's and don'ts about Streaming security
Javier Brines Garcia
Modeling the conceptual structure of FFmpeg in JavaScript
Ryan Harvey
Objectionable Uses of Objective Quality Metrics
Richard Fliam
RTMP: web video innovation or Web 1.0 hack… how did we get to now?
Sarah Allen
Large-Scale Media Archive Migration to the Cloud
Konstantin Wilms
HEVC Upload Experiments
Chris Ellsworth
Related Courses
Below are some courses that might interest you based on the learning categories and topic tags of this conference proceeding.
What Codec Should I Use?
Alan Resnick
Doing Server-Side Ad Insertion on Live Sports for 25.3M Concurrent Users
Ashutosh Agrawal
Is now the time to solve the deepfake threat?
Roderick Hodgson
Super Resolution: The scaler of tomorrow, here today!
Nick Chadwick
The do's and don'ts about Streaming security
Javier Brines Garcia
Modeling the conceptual structure of FFmpeg in JavaScript
Ryan Harvey
Objectionable Uses of Objective Quality Metrics
Richard Fliam
RTMP: web video innovation or Web 1.0 hack… how did we get to now?