Game-theoretic model:
Here, we are trying to provide the model for the coercing interaction. We make few simplifications: first, the vote utility of the seller is known to the buyer. Second, buyer has some prior assumption on the distribution of keychain height, we assume it to be uniformly random (for example, because this is our standard setup algorithm).
Utilities of the vote are denoted and .
Assume the following game with two parties:
Seller runs the setup phase protocol, outputting uniformly random number .
Buyer sets the price list: function
Seller outputs the claimed height .
If it outputs , it gets , and buyer gets , if it outputs , it gets , and buyer gets .
First, it is easy to see whether seller defects on 3rd step or not. Clearly, if , it is profitable to defect by outputting with maximal price, and otherwise, it is not profitable to defect.
Because we assume that the buyer has knowledge of , it can predict the optimal behavior of seller on the third step, lets optimize the price list.
Let's call the value "bad" if , and good otherwise. There is good values. The prices in good values monotonously increase, with step at least . Bad values are never picked correctly, so their price can be safely set to wlog.
Assuming we optimize for good values, we can ensure that they grow exactly by increments (which minimizes expected bribe over them), and ensure that expected bribe over bad values is (by setting their price to and moving them to the left of all good values): so, the optimal price list looks like .
This gives the average bribe spending with votes being purchased. This gives the cost of one vote .
Denote the advantage (if the buyer is going to buy votes it must have greater utility for them; practically speaking this advantage can be quite big if buyer is a resourceful party and seller is a small voter).
The buyer is willing to purchase votes at cost at most . Solving we get .
This means that at most votes will be purchased in this model.
Should we analyse it without prior distribution?
It is unclear for us what are the correct assumptions without buyer knowing the prior distribution of - probably that of a repeated game. However, in practice the process of generating the key will eventually be bound to the modified unconditional private key possession proof (see here Vitalik's description of such meatspace protocol for normal private keys), which fixes the distribution.
Can we do better with different distribution?
It is not exactly clear, but likely yes: while using non-uniform distribution on means that the buyer can get an advantage by choosing a subset of size with probability , this advantage likely will be offset by defections (for example, for slowly decreasing probability of obtaining ).
We did not work through details; nor did we consider more complex negotiation strategies.