Add some end-to-end test to verify if we don't break anything. (will not look on the quality of the scores)