Weird.
So I had a strong suspicion that there was a useful correlation between line count and unique identifier count in source code. Specifically, that there would be some factor of the number of lines that would be a likely and reasonably tight upper bound for the number of unique identifiers. The distribution below it, as the number of samples grow, would look roughly normal.
And, seems true!
And the high probability upper bound? ** 1:1 **
Really didn't guess it would be exactly 1:1....
@chandlerc this aligns with my suspicion that we are all secretly forth programmers