We use Plackett-Luce probabilistic model + UCB-E exposure control. Key points:
Eliminated models enter Probation period, LCB (Lower Confidence Bound) determines revival, not "revive with one win"
This platform has no corporate backing
No AI model provider intervention allowed
Driving better AI development through real user evaluation data