Benchmarks
Benchmarks for Open Metal AI
Reproducible leaderboard tasks, each with a defined dataset, evaluation metric, baseline model, and submission rules. They give everyone a shared basis for comparing methods on metal-transformation problems.
10 results
Sort
Propose a benchmark
Propose a benchmark or submit to a leaderboard
Benchmarks are open and community-governed. Propose a new task, contribute a submission, or discuss metric design with other builders.