Thanks for investigating this and for the writeup!
Did you happen to capture differences binary size, too? After compile time, the biggest negative associated with monomorphization is binary bloat (~30%, from estimates from MLton). Even if this work did not decrease compile times, if it takes even 5% off of our code size, that could have some significant impact on Servo’s performance, particularly on embedded hardware with relatively small i-cache sizes.