← Back to Arena

Verification Protocol

Verification Protocol & Algorithm Transparency

Part I — Auto-Router Engine

Each question selects 4 models from different providers for blind testing. Selection uses intent classification + four-slot forced exploration:

1.1 Intent Classifier

Lightweight regex + keyword classifier categorizes questions into:

DomainTrigger
codecode, programming, bug, API, function, implementation
mathcalculation, equation, proof, integral, probability
creativestory, poetry, creative, continuation, novel
factualwhat is, explain, history, science, principle
reasoninganalysis, comparison, logic, reasoning, argument

1.2 Four-Slot Stratified Selection

4 slots each serve a purpose, balancing exploration vs exploitation:

SlotStrategyRole
Slot 1Strongest BaselineCurrent P-L #1, ensures baseline quality
Slot 2UCB Dynamic ChallengeUCB selects potential models to challenge the leader
Slot 3Observing Gray-zoneRandomly pick from models with impressions < threshold, accumulate data
Slot 430% Upset30% chance from bottom tier, preventing filter bubbles

Slot 2 UCB-E score formula:

UCB(m) = γm + c × √( ln(N) / nm )

Where γm is the Plackett-Luce inferred model strength, N is total battles, nm is model m's battle count, c is the exploration constant.


Part II — Totoro Cross-Validation Engine

After voting, users can trigger cross-validation. Totoro refines the 4 model responses through factual distillation, producing an ultimate truth-seeking answer. Core algorithm: Four-Dimensional Weighted Consensus:

FinalScore(claim) = Wrank × Wconsensus × Wuser × Wevidence

2.1 Ranking Baseline Weight Wrank

Each model's information is weighted by its P-L rank via Sigmoid mapping:

Wrank(m) = 0.5 + σ(scorem - 1000) / 200) × 1.5

Mapped to [0.5, 2.0]. Rank #1 ≈ 1.5, bottom ≈ 0.8.

2.2 Consensus Mention Rate Wconsensus

If a core data point / logic block is independently mentioned by 3+ models:

Wconsensus = 1.0 + 0.5 × (overlap_count - 2),   if overlap ≥ 3
Wconsensus = 1.0,   otherwise

2.3 User Verification Boost Wuser

Weight modifiers based on user votes and DPO labels:

TagEffectModifier
Factually rigorous / no hallucinationWinner unique info boosted× 2.0
Code / format zero errorsWinner code blocks boosted× 2.0
Extremely strong logicWinner logic chain boosted× 1.5
Excellent instruction followingWinner structure boosted× 1.5
Severe factual hallucinationLoser info circuit-breaker× 0.0
Over-aligned / verboseLoser downweighted× 0.3
Logic break / infinite loopLoser downweighted× 0.2
Format crash / broken codeLoser downweighted× 0.3

2.4 Evidence Parameter Check Wevidence

Information with exact values, API params, or perfectly aligned with external search context gets highest priority; vague claims and uncited theories are forcibly removed.


Part III — Proof of Truth (Hash Attestation)

Each cross-validation output includes an immutable verification trace log:

VERIFICATION PROOF STRUCTURE

proof_hash = SHA256(
  battle_id +
  question +
  M1..M4 responses +
  weights_applied +
  user_signal +
  refined_answer +
  timestamp
)

e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855

Trace log includes:

Hash generated in real-time by backend, displayed as collapsible log on frontend, ensuring every answer is traceable.


Part IV — Plackett-Luce Ranking System

Arena ranking based on Plackett-Luce model + UCB-E anti-cheat strategy:

P(model i wins) = γi / Σγj

EM algorithm iteratively estimates γ parameter, combined with UCB-E (Upper Confidence Bound - Explorer) strategy balancing exploration vs exploitation.


Part V — Axiom V-Verification System

Axiom V is LLMVECT's full-stack anti-cheat and vote credibility engine, composed of four modules forming an unbypassable security pipeline:

5.1 Security Pipeline

Each vote must pass 7 layers of protection before counting toward rankings:

#LayerRejection CodeThreshold
1IP Rate Limiting42910 req / 60s
2Duplicate Vote Detection409Same user_id + battle_id
3Cooldown Period4255s cooldown
4Daily Cap429Blind 5 votes / Think 1 vote
5Device Fingerprint Check403device_hash blacklist
6Anomaly Pattern DetectionDownweightHigh-frequency / extreme bias
7Dual-Track Quota429Blind + Think counted independently

5.2 Full-Stack Device Fingerprinting

Each voting device generates a unique device_hash via triple hardware fingerprinting:

Fdevice = h(Canvastexels || WebGLvendor+renderer || AudioContextsample+fft || Systemscreen+tz+cores+mem)

device_hash uses daily salt (same device produces stable hash within 24h), ensuring tracking continuity. Constraints:

ConstraintValueViolation
Max user_ids per device3Registration rejected (403)
Daily vote cap per device30Device banned (403)
Banned device blacklistPermanentAll linked user_ids circuit-broken

5.3 Dynamic Difficulty Factor (D-Factor)

Each battle's ELO K-value is dynamically adjusted by vote consensus rate — contested battles get higher weight, landslide battles get minimal weight:

Kdynamic = Kbase × D-Factor,    Kbase = 32

D-Factor mapping function:

Consensus RateD-FactorSemantics
≥ 1.00 (unanimous)0.2 / Landslide, minimal ranking impact
0.751.67Majority agree, moderate impact
0.50 (split)2.33Even split, high-weight decisive
≤ 0.25 (contentious)3.0Fierce contest, maximum weight impact

Consensus rate = leader votes / total votes, updated in real-time after each round, ensuring subsequent voters face ELO params adapted to current contention.

5.4 Dual-Track Quota

Blind and Think modes are counted independently, no cross-consumption:

ModeDaily QuotaRevealELO Eligible
Blind5 / dayRevealed after voting✅ Yes
Think1 / dayReal-time visible reasoning✅ Yes

When Think Mode is on, users see the full reasoning chain, but limited to 1 per day to prevent over-reliance on a single model's reasoning.

AXIOM V INTEGRITY GUARANTEE

Each vote record carries: device_hash | is_think_enabled | task_difficulty_factor
Each battle carries: total_votes | consensus_rate | difficulty_factor
All fields auditable, SHA-256 attested as V-Verification Hash Chain

LLMVECT Verification Protocol v2.0 — Last updated: 2026-05-02
All algorithms are open for audit. This page is the single source of truth.