Ok so quick question (don't know how well it translates from example.)
Your sampling every post not a normalized list of identical amount of replies?
For instance
Post A has 8 replies on tiny server
Post B has 1 replies on large server
is different info than two 8 reply posts, no?