Number of characters typed before success

The number of characters typed in each session is a proxy for the amount of effort a user must exert to find the item they are looking for. The 95% CI mostly overlaps, suggesting the tuned ranking is not an improvement.

Abandonment Rate

The ratio of page loads with start events against the page loads with click events is interepreted loosely as the abandonment rate of search. The 95% CI completely overlaps, suggesting the test treatment had no effect on abandonment rates. While this is consistent with previous tests, we don't know why so many sessions start but don't complete the entity selection.

Click Position

The position of clicked result is another proxy for the amount of effort a user must exert to find the item they are looking for. The 95% CI overlaps, suggesting the tuning is not an improvement to click position.

Looking into this result closer there is a statistically significant derease in Clicks@1, suggesting the tuning is not an improvement.