Ml Model Eval Benchmark
by Muhammad Mazhar Saeed
v0.1.0
Compare model candidates using weighted metrics and deterministic ranking outputs. Use for benchmark leaderboards and model promotion decisions.
86
Downloads
1
Versions
Latest Changes
Install Ml Model Eval Benchmark with One Click
Get a managed OpenClaw server and install this skill from your dashboard. No SSH, no Docker, no configuration needed.
Deploy with ClawHost