@mhoye Found it. Patrick Juola was the guy. 1998, categorizing the complexity of languages by gzipping them. scholar.google.co.uk/scholar?q